Polymeric dyes with linker groups comprising deoxyribose

ABSTRACT

Compounds useful as fluorescent or colored dyes are disclosed. The compounds have the following structure (I): (I) or a stereoisomer, tautomer or salt thereof, wherein R1, R2, R3, R4, R5, L1a, L1b, L2, L3, L4, L5, L6, L7, M1, M2, q, w, m and n are as defined herein. Methods associated with preparation and use of such compounds are also provided.

BACKGROUND Field

The present disclosure is generally directed to polymeric fluorescent or colored dyes having deoxyribose linker groups, and methods for their preparation and use in various analytical methods.

Description of the Related Art

Fluorescent and/or colored dyes are known to be particularly suitable for applications in which a highly sensitive detection reagent is desirable. Dyes that are able to preferentially label a specific ingredient or component in a sample enable the researcher to determine the presence, quantity and/or location of that specific ingredient or component. In addition, specific systems can be monitored with respect to their spatial and temporal distribution in diverse environments.

Fluorescence and colorimetric methods are extremely widespread in chemistry and biology. These methods give useful information on the presence, structure, distance, orientation, complexation and/or location for biomolecules. In addition, time-resolved methods are increasingly used in measurements of dynamics and kinetics. As a result, many strategies for fluorescence or color labeling of biomolecules, such as nucleic acids and protein, have been developed. Since analysis of biomolecules typically occurs in an aqueous environment, the focus has been on development and use of water soluble dyes.

Highly fluorescent or colored dyes are desirable since use of such dyes increases the signal to noise ratio and provides other related benefits. Accordingly, attempts have been made to increase the signal from known fluorescent and/or colored moieties. For example, dimeric and polymeric compounds comprising two or more fluorescent and/or colored moieties have been prepared in anticipation that such compounds would result in brighter dyes. However, as a result of intramolecular fluorescence quenching, the known dimeric and polymeric dyes have not achieved the desired increase in brightness.

There is thus a need in the art for dyes having an increased molar brightness. Ideally, such dyes and biomarkers should be intensely colored or fluorescent and should be available in a variety of colors and fluorescent wavelengths. The present disclosure fulfills this need and provides further related advantages.

BRIEF SUMMARY

In brief, embodiments of the present disclosure are generally directed to compounds useful as water soluble, fluorescent and/or colored dyes and/or probes that enable visual detection of analyte molecules, such as biomolecules, as well as reagents for their preparation. Methods for visually detecting analyte molecules using the dyes are also described.

Embodiments of the presently disclosed dyes include two or more fluorescent and/or colored moieties covalently linked by linkers (e.g., “L²”, “L³”, “L⁴”, “L⁵” and “L⁶”). In contrast to previous reports of dimeric and/or polymeric dyes, the present dyes are significantly brighter than the corresponding monomeric dye compound. While, not wishing to be bound by theory, it is believed that the linker moiety provides sufficient spatial separation between the fluorescent and/or colored moieties such that intramolecular fluorescence quenching is reduced and/or eliminated.

The water soluble, fluorescent or colored dyes of embodiments of the disclosure are intensely colored and/or fluorescent and can be readily observed by visual inspection or other means. In some embodiments the compounds may be observed without prior illumination or chemical or enzymatic activation. By appropriate selection of the dye, as described herein, visually detectable analyte molecules of a variety of colors may be obtained.

In one embodiment, compounds having the following structure (I) are provided:

or a stereoisomer, tautomer, or salt thereof, wherein R¹, R², R³, R⁴, R⁵, L^(1a), L^(1b), L², L³, L⁴, L⁵, L⁶, L⁷, M¹, M², q, w, m and n are as defined herein. Compounds of structure (I) find utility in a number of applications, including use as fluorescent and/or colored dyes in various analytical methods.

In another embodiment, a method for staining a sample is provided, the method comprises adding to said sample a compound of structure (I) in an amount sufficient to produce an optical response when said sample is illuminated at an appropriate wavelength.

In still other embodiments, the present disclosure provides a method for visually detecting an analyte molecule, comprising:

(a) providing a compound of structure (I); and

(b) detecting the compound by its visible properties.

Other disclosed methods include a method for visually detecting a biomolecule, the method comprising:

(a) admixing a compound of structure (I) with one or more biomolecules; and

(b) detecting the compound by its visible properties.

Other embodiments provide a method for visually detecting an analyte, the method comprising:

(a) providing a compound as disclosed herein, wherein R¹ or R² comprises a linker comprising a covalent bond to a targeting moiety having specificity for the analyte;

(b) admixing the compound and the analyte, thereby associating the targeting moiety and the analyte; and

(c) detecting the compound by its visible properties.

Other embodiments are directed to a composition comprising a compound of structure (I) and one or more analyte molecule, such as a biomolecule. Use of such compositions in analytical methods for detection of the one or more biomolecules is also provided.

These and other aspects of the disclosure will be apparent upon reference to the following detailed description.

BRIEF DESCRIPTION OF THE DRAWINGS

In the figures, identical reference numbers identify similar elements. The sizes and relative positions of elements in the figures are not necessarily drawn to scale and some of these elements are enlarged and positioned to improve figure legibility. Further, the particular shapes of the elements as drawn are not intended to convey any information regarding the actual shape of the particular elements, and have been solely selected for ease of recognition in the figures.

FIG. 1 shows a PAGE gel of Compound I-1 and comparative compounds

FIG. 2 shows a PAGE gel of Compound I-2 and comparative compounds

DETAILED DESCRIPTION

In the following description, certain specific details are set forth in order to provide a thorough understanding of various embodiments of the disclosure. However, one skilled in the art will understand that the disclosure may be practiced without these details.

Unless the context requires otherwise, throughout the present specification and claims, the word “comprise” and variations thereof, such as, “comprises” and “comprising” are to be construed in an open, inclusive sense, that is, as “including, but not limited to”.

Reference throughout this specification to “one embodiment” or “an embodiment” means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present disclosure. Thus, the appearances of the phrases “in one embodiment” or “in an embodiment” in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.

“Amino” refers to the —NH₂ group.

“Carboxy” refers to the —CO₂H group.

“Cyano” refers to the —CN group.

“Formyl” refers to the —C(═O)H group.

“Hydroxy” or “hydroxyl” refers to the —OH group.

“Imino” refers to the ═NH group.

“Nitro” refers to the —NO₂ group.

“Oxo” refers to the ═O substituent group.

“Sulfhydryl” refers to the —SH group.

“Thioxo” refers to the ═S group.

“Alkyl” refers to a straight or branched hydrocarbon chain group consisting solely of carbon and hydrogen atoms, containing no unsaturation, having from one to twelve carbon atoms (C₁-C₁₂ alkyl), one to eight carbon atoms (C₁-C₈ alkyl) or one to six carbon atoms (C₁-C₆ alkyl), and which is attached to the rest of the molecule by a single bond, e.g., methyl, ethyl, n-propyl, 1-methylethyl (iso-propyl), n-butyl, n-pentyl, 1,1-dimethylethyl (t-butyl), 3-methylhexyl, 2-methylhexyl, and the like. Unless stated otherwise specifically in the specification, alkyl groups are optionally substituted.

“Alkylene” or “alkylene chain” refers to a straight or branched divalent hydrocarbon chain linking the rest of the molecule to a radical group, consisting solely of carbon and hydrogen, containing no unsaturation, and having from one to twelve carbon atoms, e.g., methylene, ethylene, propylene, n-butylene, ethenylene, propenylene, n-butenylene, propynylene, n-butynylene, and the like. The alkylene chain is attached to the rest of the molecule through a single bond and to the radical group through a single bond. The points of attachment of the alkylene chain to the rest of the molecule and to the radical group can be through one carbon or any two carbons within the chain. Unless stated otherwise specifically in the specification, alkylene is optionally substituted.

“Alkenylene” or “alkenylene chain” refers to a straight or branched divalent hydrocarbon chain linking the rest of the molecule to a radical group, consisting solely of carbon and hydrogen, containing at least one carbon-carbon double bond and having from two to twelve carbon atoms, e.g., ethenylene, propenylene, n-butenylene, and the like. The alkenylene chain is attached to the rest of the molecule through a single bond and to the radical group through a double bond or a single bond. The points of attachment of the alkenylene chain to the rest of the molecule and to the radical group can be through one carbon or any two carbons within the chain. Unless stated otherwise specifically in the specification, alkenylene is optionally substituted.

“Alkynylene” or “alkynylene chain” refers to a straight or branched divalent hydrocarbon chain linking the rest of the molecule to a radical group, consisting solely of carbon and hydrogen, containing at least one carbon-carbon triple bond and having from two to twelve carbon atoms, e.g., ethenylene, propenylene, n-butenylene, and the like. The alkynylene chain is attached to the rest of the molecule through a single bond and to the radical group through a double bond or a single bond. The points of attachment of the alkynylene chain to the rest of the molecule and to the radical group can be through one carbon or any two carbons within the chain. Unless stated otherwise specifically in the specification, alkynylene is optionally substituted.

“Alkylether” refers to any alkyl group as defined above, wherein at least one carbon-carbon bond is replaced with a carbon-oxygen bond. The carbon-oxygen bond may be on the terminal end (as in an alkoxy group) or the carbon oxygen bond may be internal (i.e., C—O—C). Alkylethers include at least one carbon oxygen bond, but may include more than one. For example, polyethylene glycol (PEG) is included within the meaning of alkylether. Unless stated otherwise specifically in the specification, an alkylether group is optionally substituted. For example, in some embodiments an alkylether is substituted with an alcohol or —OP(═R_(a))(R_(b))R_(c), wherein each of R_(a), R_(b) and R_(c) is as defined for compounds of structure (I).

“Alkoxy” refers to a group of the formula —OR_(a) where R_(a) is an alkyl group as defined above containing one to twelve carbon atoms. Unless stated otherwise specifically in the specification, an alkoxy group is optionally substituted.

“Alkoxyalkylether” refers to a group of the formula —OR_(a)R_(b) where R_(a) is an alkylene group as defined above containing one to twelve carbon atoms, and R_(b) is an alkylether group as defined herein. Unless stated otherwise specifically in the specification, an alkoxyalkylether group is optionally substituted, for example substituted with an alcohol or —OP(═R_(a))(R_(b))R_(c), wherein each of R_(a), R_(b) and R_(c) is as defined for compounds of structure (I).

“Heteroalkyl” refers to an alkyl group, as defined above, comprising at least one heteroatom (e.g., N, O, P or S) within the alkyl group or at a terminus of the alkyl group. In some embodiments, the heteroatom is within the alkyl group (i.e., the heteroalkyl comprises at least one carbon-[heteroatom]_(x)-carbon bond, where x is 1, 2 or 3). In other embodiments, the heteroatom is at a terminus of the alkyl group and thus serves to join the alkyl group to the remainder of the molecule (e.g., M1-H-A), where M1 is a portion of the molecule, H is a heteroatom and A is an alkyl group). Unless stated otherwise specifically in the specification, a heteroalkyl group is optionally substituted. Exemplary heteroalkyl groups include ethylene oxide (e.g., polyethylene oxide), optionally including phosphorous-oxygen bonds, such as phosphodiester bonds.

“Heteroalkoxy” refers to a group of the formula —OR_(a) where R_(a) is a heteroalkyl group as defined above containing one to twelve carbon atoms. Unless stated otherwise specifically in the specification, a heteroalkoxy group is optionally substituted.

“Heteroalkylene” refers to an alkylene group, as defined above, comprising at least one heteroatom (e.g., Si, N, O, P or S) within the alkylene chain or at a terminus of the alkylene chain. In some embodiments, the heteroatom is within the alkylene chain (i.e., the heteroalkylene comprises at least one carbon-[heteroatom]-carbon bond, where x is 1, 2 or 3). In other embodiments, the heteroatom is at a terminus of the alkylene and thus serves to join the alkylene to the remainder of the molecule (e.g., M1-H-A-M2, where M1 and M2 are portions of the molecule, H is a heteroatom and A is an alkylene). Unless stated otherwise specifically in the specification, a heteroalkylene group is optionally substituted. Exemplary heteroalkylene groups include ethylene oxide (e.g., polyethylene oxide) and the “C,” “HEG,” and “PEG 1K” linking groups illustrated below:

Multimers of the above C-linker, HEG linker and/or PEG 1K linker are included in various embodiments of heteroalkylene linkers. In some embodiments of the PEG 1K linker, n ranges from 19-25, for example n is 19, 20, 21, 22, 23, 24, or 25. Multimers may comprise, for example, the following structure:

wherein x is 0 or an integer greater than 0, for example, x ranges from 0-100 (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10).

“Heteroalkenylene” is a heteroalkylene, as defined above, comprising at least one carbon-carbon double bond. Unless stated otherwise specifically in the specification, a heteroalkenylene group is optionally substituted.

“Heteroalkynylene” is a heteroalkylene comprising at least one carbon-carbon triple bond. Unless stated otherwise specifically in the specification, a heteroalkynylene group is optionally substituted.

“Heteroatomic” in reference to a “heteroatomic linker” refers to a linker group consisting of one or more heteroatoms. Exemplary heteroatomic linkers include single atoms selected from the group consisting of O, N, P and S, and multiple heteroatoms for example a linker having the formula —P(O—)(═O)O— or —OP(O—)(═O)O— and multimers and combinations thereof.

“Phosphate” refers to the —OP(═O)(R_(a))R_(b) group, wherein R_(a) is OH, O— or OR_(c); and R_(b) is OH, O—, OR_(c), a thiophosphate group or a further phosphate group, wherein R_(c) is a counter ion (e.g., Na⁺ and the like).

“Phosphoalkyl” refers to the —OP(═O)(R_(a))R_(b) group, wherein R_(a) is OH, O— or OR_(c); and R_(b) is —Oalkyl, wherein R_(c) is a counter ion (e.g., Na⁺ and the like). Unless stated otherwise specifically in the specification, a phosphoalkyl group is optionally substituted. For example, in certain embodiments, the —Oalkyl moiety in a phosphoalkyl group is optionally substituted with one or more of hydroxyl, amino, sulfhydryl, phosphate, thiophosphate, phosphoalkyl, thiophosphoalkyl, phosphoalkylether, thiophosphoalkylether or —OP(═R_(a))(R_(b))R_(c), wherein each of R_(a), R_(b) and R_(c) is as defined for compounds of structure (I).

“Phosphoalkylether” refers to the —OP(═O)(R_(a))R_(b) group, wherein R_(a) is OH, O— or OR_(c); and R_(b) is —Oalkylether, wherein R_(c) is a counter ion (e.g., Na⁺ and the like). Unless stated otherwise specifically in the specification, a phosphoalkylether group is optionally substituted. For example, in certain embodiments, the —Oalkylether moiety in a phosphoalkylether group is optionally substituted with one or more of hydroxyl, amino, sulfhydryl, phosphate, thiophosphate, phosphoalkyl, thiophosphoalkyl, phosphoalkylether, thiophosphoalkylether or —OP(═R_(a))(R_(b))R_(c), wherein each of R_(a), R_(b) and R_(c) is as defined for compounds of structure (I).

“Thiophosphate” refers to the —OP(═R_(a))(R_(b))R_(c) group, wherein R_(a) is O or S, R_(b) is OH, O—, S—, OR_(d) or SR_(d); and R_(c) is OH, SH, O—, S—, OR_(d), SR_(d), a phosphate group or a further thiophosphate group, wherein R_(d) is a counter ion (e.g., Na⁺ and the like) and provided that: i) R_(a) is S; ii) R_(b) is S— or SR_(d); iii) R_(c) is SH, S— or SR_(d); or iv) a combination of i), ii) and/or iii).

“Thiophosphoalkyl” refers to the —OP(═R_(a))(R_(b))R_(c) group, wherein R_(a) is O or S, R_(b) is OH, O—, S—, OR_(d) or SR_(d); and R_(c) is —Oalkyl, wherein R_(d) is a counter ion (e.g., Na⁺ and the like) and provided that: i) R_(a) is S; ii) R_(b) is S— or SR_(d); or iii) R_(a) is S and R_(b) is S— or SR_(d). Unless stated otherwise specifically in the specification, a thiophosphoalkyl group is optionally substituted. For example, in certain embodiments, the —Oalkyl moiety in a thiophosphoalkyl group is optionally substituted with one or more of hydroxyl, amino, sulfhydryl, phosphate, thiophosphate, phosphoalkyl, thiophosphoalkyl, phosphoalkylether, thiophosphoalkylether or —OP(═R_(a))(R_(b))R_(c), wherein each of R_(a), R_(b) and R_(c) is as defined for compounds of structure (I).

“Thiophosphoalkylether” refers to the —OP(═R_(a))(R_(b))R_(c) group, wherein R_(a) is O or S, R_(b) is OH, O—, S—, OR_(d) or SR_(d); and R_(c) is —Oalkylether, wherein R_(d) is a counter ion (e.g., Na⁺ and the like) and provided that: i) R_(a) is S; ii) R_(b) is S— or SR_(d); or iii) R_(a) is S and R_(b) is S— or SR_(d). Unless stated otherwise specifically in the specification, a thiophosphoalkylether group is optionally substituted. For example, in certain embodiments, the —Oalkylether moiety in a thiophosphoalkyl group is optionally substituted with one or more of hydroxyl, amino, sulfhydryl, phosphate, thiophosphate, phosphoalkyl, thiophosphoalkyl, phosphoalkylether, thiophosphoalkylether or —OP(═R_(a))(R_(b))R_(c), wherein each of R_(a), R_(b) and R_(c) is as defined for compounds of structure (I).

“Carbocyclic” refers to a stable 3- to 18-membered aromatic or non-aromatic ring comprising 3 to 18 carbon atoms. Unless stated otherwise specifically in the specification, a carbocyclic ring may be a monocyclic, bicyclic, tricyclic or tetracyclic ring system, which may include fused or bridged ring systems, and may be partially or fully saturated. Non-aromatic carbocyclyl radicals include cycloalkyl, while aromatic carbocyclyl radicals include aryl. Unless stated otherwise specifically in the specification, a carbocyclic group is optionally substituted.

“Cycloalkyl” refers to a stable non-aromatic monocyclic or polycyclic carbocyclic ring, which may include fused or bridged ring systems, having from three to fifteen carbon atoms, preferably having from three to ten carbon atoms, and which is saturated or unsaturated and attached to the rest of the molecule by a single bond. Monocyclic cycloalkyls include, for example, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cycloheptyl, and cyclooctyl. Polycyclic cycloalkyls include, for example, adamantyl, norbornyl, decalinyl, 7,7-dimethyl-bicyclo-[2.2.1]heptanyl, and the like. Unless stated otherwise specifically in the specification, a cycloalkyl group is optionally substituted.

“Aryl” refers to a ring system comprising at least one carbocyclic aromatic ring. In some embodiments, an aryl comprises from 6 to 18 carbon atoms. The aryl ring may be a monocyclic, bicyclic, tricyclic or tetracyclic ring system, which may include fused or bridged ring systems. Aryls include, but are not limited to, aryls derived from aceanthrylene, acenaphthylene, acephenanthrylene, anthracene, azulene, benzene, chrysene, fluoranthene, fluorene, as-indacene, s-indacene, indane, indene, naphthalene, phenalene, phenanthrene, pleiadene, pyrene, and triphenylene. Unless stated otherwise specifically in the specification, an aryl group is optionally substituted.

“Heterocyclic” refers to a stable 3- to 18-membered aromatic or non-aromatic ring comprising one to twelve carbon atoms and from one to six heteroatoms selected from the group consisting of nitrogen, oxygen and sulfur. Unless stated otherwise specifically in the specification, the heterocyclic ring may be a monocyclic, bicyclic, tricyclic or tetracyclic ring system, which may include fused or bridged ring systems; and the nitrogen, carbon or sulfur atoms in the heterocyclic ring may be optionally oxidized; the nitrogen atom may be optionally quaternized; and the heterocyclic ring may be partially or fully saturated. Examples of aromatic heterocyclic rings are listed below in the definition of heteroaryls (i.e., heteroaryl being a subset of heterocyclic). Examples of non-aromatic heterocyclic rings include, but are not limited to, dioxolanyl, thienyl[1,3]dithianyl, decahydroisoquinolyl, imidazolinyl, imidazolidinyl, isothiazolidinyl, isoxazolidinyl, morpholinyl, octahydroindolyl, octahydroisoindolyl, 2-oxopiperazinyl, 2-oxopiperidinyl, 2-oxopyrrolidinyl, oxazolidinyl, piperidinyl, piperazinyl, 4-piperidonyl, pyrrolidinyl, pyrazolidinyl, pyrazolopyrimidinyl, quinuclidinyl, thiazolidinyl, tetrahydrofuryl, trioxanyl, trithianyl, triazinanyl, tetrahydropyranyl, thiomorpholinyl, thiamorpholinyl, 1-oxo-thiomorpholinyl, and 1,1-dioxo-thiomorpholinyl. Unless stated otherwise specifically in the specification, a heterocyclic group is optionally substituted.

“Heteroaryl” refers to a 5- to 14-membered ring system comprising one to thirteen carbon atoms, one to six heteroatoms selected from the group consisting of nitrogen, oxygen and sulfur, and at least one aromatic ring. For purposes of certain embodiments of this disclosure, the heteroaryl radical may be a monocyclic, bicyclic, tricyclic or tetracyclic ring system, which may include fused or bridged ring systems; and the nitrogen, carbon or sulfur atoms in the heteroaryl radical may be optionally oxidized; the nitrogen atom may be optionally quaternized. Examples include, but are not limited to, azepinyl, acridinyl, benzimidazolyl, benzthiazolyl, benzindolyl, benzodioxolyl, benzofuranyl, benzooxazolyl, benzothiazolyl, benzothiadiazolyl, benzo[b][1,4]dioxepinyl, 1,4-benzodioxanyl, benzonaphthofuranyl, benzoxazolyl, benzodioxolyl, benzodioxinyl, benzopyranyl, benzopyranonyl, benzofuranyl, benzofuranonyl, benzothienyl (benzothiophenyl), benzotriazolyl, benzo[4,6]imidazo[1,2-a]pyridinyl, benzoxazolinonyl, benzimidazolthionyl, carbazolyl, cinnolinyl, dibenzofuranyl, dibenzothiophenyl, furanyl, furanonyl, isothiazolyl, imidazolyl, indazolyl, indolyl, indazolyl, isoindolyl, indolinyl, isoindolinyl, isoquinolyl, indolizinyl, isoxazolyl, naphthyridinyl, oxadiazolyl, 2-oxoazepinyl, oxazolyl, oxiranyl, 1-oxidopyridinyl, 1-oxidopyrimidinyl, 1-oxidopyrazinyl, 1-oxidopyridazinyl, 1-phenyl-1H-pyrrolyl, phenazinyl, phenothiazinyl, phenoxazinyl, phthalazinyl, pteridinyl, pteridinonyl, purinyl, pyrrolyl, pyrazolyl, pyridinyl, pyridinonyl, pyrazinyl, pyrimidinyl, pryrimidinonyl, pyridazinyl, pyrrolyl, pyrido[2,3-d]pyrimidinonyl, quinazolinyl, quinazolinonyl, quinoxalinyl, quinoxalinonyl, quinolinyl, isoquinolinyl, tetrahydroquinolinyl, thiazolyl, thiadiazolyl, thieno[3,2-d]pyrimidin-4-onyl, thieno[2,3-d]pyrimidin-4-onyl, triazolyl, tetrazolyl, triazinyl, and thiophenyl (i.e. thienyl). Unless stated otherwise specifically in the specification, a heteroaryl group is optionally substituted.

The suffix “-ene” refers to a particular structural feature (e.g., alkyl, aryl, heteroalkyl, heteroaryl) attached to the rest of the molecule through a single bond and attached to a radical group through a single bond. In other words, the suffix “-ene” refers to a linker having the structural features of the moiety to which it is attached. The points of attachment of the “-ene” chain to the rest of the molecule and to the radical group can be through one atom of or any two atoms within the chain. For example, a heteroarylene refers to a linker comprising a heteroaryl moiety as defined herein.

“Fused” refers to a ring system comprising at least two rings, wherein the two rings share at least one common ring atom, for example two common ring atoms. When the fused ring is a heterocyclyl ring or a heteroaryl ring, the common ring atom(s) may be carbon or nitrogen. Fused rings include bicyclic, tricyclic, tertracyclic, and the like.

The term “substituted” used herein means any of the above groups (e.g., alkyl, alkylene, alkenylene, alkynylene, heteroalkylene, heteroalkenylene, heteroalkynylene, alkoxy, alkylether, phosphoalkyl, phosphoalkylether, thiophosphoalkyl, thiophosphoalkylether, carbocyclic, cycloalkyl, aryl, heterocyclic and/or heteroaryl) wherein at least one hydrogen atom (e.g., 1, 2, 3 or all hydrogen atoms) is replaced by a bond to a non-hydrogen atoms such as, but not limited to: a halogen atom such as F, Cl, Br, and I; an oxygen atom in groups such as hydroxyl groups, alkoxy groups, and ester groups; a sulfur atom in groups such as thiol groups, thioalkyl groups, sulfone groups, sulfonyl groups, and sulfoxide groups; a nitrogen atom in groups such as amines, amides, alkylamines, dialkylamines, arylamines, alkylarylamines, diarylamines, N-oxides, imides, and enamines; a silicon atom in groups such as trialkylsilyl groups, dialkylarylsilyl groups, alkyldiarylsilyl groups, and triarylsilyl groups; and other heteroatoms in various other groups. “Substituted” also means any of the above groups in which one or more hydrogen atoms are replaced by a higher-order bond (e.g., a double- or triple-bond) to a heteroatom such as oxygen in oxo, carbonyl, carboxyl, and ester groups; and nitrogen in groups such as imines, oximes, hydrazones, and nitriles. For example, “substituted” includes any of the above groups in which one or more hydrogen atoms are replaced with —NR_(g)R_(h), —NR_(g)C(═O)R_(h), —NR_(g)C(═O)NR_(g)R_(h), —NR_(g)C(═O)OR_(h), —NR_(g)SO₂R_(h), —OC(═O)NR_(g)R_(h), —OR_(g), —SR_(g), —SOR_(g), —SO₂R_(g), —OSO₂R_(g), —SO₂OR_(g), ═NSO₂R_(g), and —SO₂NR_(g)R_(h). “Substituted also means any of the above groups in which one or more hydrogen atoms are replaced with —C(═O)R_(g), —C(═O)OR_(g), —C(═O)NR_(g)R_(h), —CH₂SO₂R_(g), —CH₂SO₂NR_(g)R_(h). In the foregoing, R_(g) and R_(h) are the same or different and independently hydrogen, alkyl, alkoxy, alkylamino, thioalkyl, aryl, aralkyl, cycloalkyl, cycloalkylalkyl, haloalkyl, heterocyclyl, N-heterocyclyl, heterocyclylalkyl, heteroaryl, N-heteroaryl and/or heteroarylalkyl. “Substituted” further means any of the above groups in which one or more hydrogen atoms are replaced by a bond to an amino, cyano, hydroxyl, imino, nitro, oxo, thioxo, halo, alkyl, alkoxy, alkylamino, thioalkyl, aryl, aralkyl, cycloalkyl, cycloalkylalkyl, haloalkyl, heterocyclyl, N-heterocyclyl, heterocyclylalkyl, heteroaryl, N-heteroaryl and/or heteroarylalkyl group. In addition, each of the foregoing substituents may also be optionally substituted with one or more of the above substituents.

“Conjugation” refers to the overlap of one p-orbital with another p-orbital across an intervening sigma bond. Conjugation may occur in cyclic or acyclic compounds. A “degree of conjugation” refers to the overlap of at least one p-orbital with another p-orbital across an intervening sigma bond. For example, 1, 3-butadine has one degree of conjugation, while benzene and other aromatic compounds typically have multiple degrees of conjugation. Fluorescent and colored compounds typically comprise at least one degree of conjugation.

“Fluorescent” refers to a molecule which is capable of absorbing light of a particular frequency and emitting light of a different frequency. Fluorescence is well-known to those of ordinary skill in the art.

“Colored” refers to a molecule which absorbs light within the colored spectrum (i.e., red, yellow, blue and the like).

A “linker” refers to a contiguous chain of at least one atom, such as carbon, oxygen, nitrogen, sulfur, phosphorous and combinations thereof, which connects a portion of a molecule to another portion of the same molecule or to a different molecule, moiety or solid support (e.g., microparticle). Linkers may connect the molecule via a covalent bond or other means, such as ionic or hydrogen bond interactions.

The term “biomolecule” refers to any of a variety of biological materials, including nucleic acids, carbohydrates, amino acids, polypeptides, glycoproteins, hormones, aptamers and mixtures thereof

ore specifically, the term is intended to include, without limitation, RNA,

DNA, oligonucleotides, modified or derivatized nucleotides, enzymes, receptors, prions, receptor ligands (including hormones), antibodies, antigens, and toxins, as well as bacteria, viruses, blood cells, and tissue cells. The visually detectable biomolecules of the disclosure (e.g., compounds of structure (I) having a biomolecule linked thereto) are prepared, as further described herein, by contacting a biomolecule with a compound having a reactive group that enables attachment of the biomolecule to the compound via any available atom or functional group, such as an amino, hydroxy, carboxyl, or sulfhydryl group on the biomolecule.

A “reactive group” is a moiety capable of reacting with a second reactive groups (e.g., a “complementary reactive group”) to form one or more covalent bonds, for example by a displacement, oxidation, reduction, addition or cycloaddition reaction. Exemplary reactive groups are provided in Table 1, and include for example, nucleophiles, electrophiles, dienes, dienophiles, aldehyde, oxime, hydrazone, alkyne, amine, azide, acylazide, acylhalide, nitrile, nitrone, sulfhydryl, disulfide, sulfonyl halide, isothiocyanate, imidoester, activated ester, ketone, α,β-unsaturated carbonyl, alkene, maleimide, α-haloimide, epoxide, aziridine, tetrazine, tetrazole, phosphine, biotin, thiirane and the like.

“Bio-conjugation” or “bio-conjugate” and related variations refer to a chemical reaction strategy for forming a stable covalent bond between two molecules. The term “bio-conjugation” is generally used when one of the molecules is a biomolecule (e.g., an antibody), but can be used to describe forming a covalent bond with a non-biomolecule (e.g., a polymeric resin). The product or compound resulting from such a reaction strategy is a “conjugate,” “bio-conjugate” or a grammatical equivalent.

The terms “visible” and “visually detectable” are used herein to refer to substances that are observable by visual inspection, without prior illumination, or chemical or enzymatic activation. Such visually detectable substances absorb and emit light in a region of the spectrum ranging from about 300 to about 900 nm. Preferably, such substances are intensely colored, preferably having a molar extinction coefficient of at least about 40,000, more preferably at least about 50,000, still more preferably at least about 60,000, yet still more preferably at least about 70,000, and most preferably at least about 80,000 M⁻¹cm⁻¹. The compounds of the disclosure may be detected by observation with the naked eye, or with the aid of an optically based detection device, including, without limitation, absorption spectrophotometers, transmission light microscopes, digital cameras and scanners. Visually detectable substances are not limited to those which emit and/or absorb light in the visible spectrum. Substances which emit and/or absorb light in the ultraviolet (UV) region (about 10 nm to about 400 nm), infrared (IR) region (about 700 nm to about 1 mm), and substances emitting and/or absorbing in other regions of the electromagnetic spectrum are also included with the scope of “visually detectable” substances.

For purposes of embodiments of the disclosure, the term “photostable visible dye” refers to a chemical moiety that is visually detectable, as defined hereinabove, and is not significantly altered or decomposed upon exposure to light. Preferably, the photostable visible dye does not exhibit significant bleaching or decomposition after being exposed to light for at least one hour. More preferably, the visible dye is stable after exposure to light for at least 12 hours, still more preferably at least 24 hours, still yet more preferably at least one week, and most preferably at least one month. Non-limiting examples of photostable visible dyes suitable for use in the compounds and methods of the disclosure include azo dyes, thioindigo dyes, quinacridone pigments, dioxazine, phthalocyanine, perinone, diketopyrrolopyrrole, quinophthalone, and truarycarbonium.

As used herein, the term “perylene derivative” is intended to include any substituted perylene that is visually detectable. However, the term is not intended to include perylene itself. The terms “anthracene derivative”, “naphthalene derivative”, and “pyrene derivative” are used analogously. In some preferred embodiments, a derivative (e.g., perylene, pyrene, anthracene or naphthalene derivative) is an imide, bisimide or hydrazamimide derivative of perylene, anthracene, naphthalene, or pyrene.

The visually detectable molecules of various embodiments of the disclosure are useful for a wide variety of analytical applications, such as biochemical and biomedical applications, in which there is a need to determine the presence, location, or quantity of a particular analyte (e.g., biomolecule). In another aspect, therefore, the disclosure provides a method for visually detecting a biomolecule, comprising: (a) providing a biological system with a visually detectable biomolecule comprising the compound of structure (I) linked to a biomolecule; and (b) detecting the biomolecule by its visible properties. For purposes of the disclosure, the phrase “detecting the biomolecule by its visible properties” means that the biomolecule, without illumination or chemical or enzymatic activation, is observed with the naked eye, or with the aid of a optically based detection device, including, without limitation, absorption spectrophotometers, transmission light microscopes, digital cameras and scanners. A densitometer may be used to quantify the amount of visually detectable biomolecule present. For example, the relative quantity of the biomolecule in two samples can be determined by measuring relative optical density. If the stoichiometry of dye molecules per biomolecule is known, and the extinction coefficient of the dye molecule is known, then the absolute concentration of the biomolecule can also be determined from a measurement of optical density. As used herein, the term “biological system” is used to refer to any solution or mixture comprising one or more biomolecules in addition to the visually detectable biomolecule. Nonlimiting examples of such biological systems include cells, cell extracts, tissue samples, electrophoretic gels, assay mixtures, and hybridization reaction mixtures.

“Solid support” or “solid support residue” refers to any solid substrate known in the art for solid-phase support of molecules, for example a “microparticle” refers to any of a number of small particles useful for attachment to compounds of the disclosure, including, but not limited to, glass beads, magnetic beads, polymeric beads, non-polymeric beads, and the like. In certain embodiments, a microparticle comprises polystyrene beads.

A “targeting moiety” is a moiety that selectively binds or associates with a particular target, such as an analyte molecule. “Selectively” binding or associating means a targeting moiety preferentially associates or binds with the desired target relative to other targets. In some embodiments the compounds disclosed herein include linkages to targeting moieties for the purpose of selectively binding or associating the compound with an analyte of interest (i.e., the target of the targeting moiety), thus allowing detection of the analyte. Exemplary targeting moieties include, but are not limited to, antibodies, antigens, nucleic acid sequences, enzymes, proteins, cell surface receptor antagonists, and the like. In some embodiments, the targeting moiety is a moiety, such as an antibody, that selectively binds or associates with a target feature on or in a cell, for example a target feature on a cell membrane or other cellular structure, thus allowing for detection of cells of interest. Small molecules that selectively bind or associate with a desired analyte are also contemplated as targeting moieties in certain embodiments. One of skill in the art will understand other analytes, and the corresponding targeting moiety, that will be useful in various embodiments.

“Base pairing moiety” refers to a heterocyclic moiety capable of hybridizing with a complementary heterocyclic moiety via hydrogen bonds (e.g., Watson-Crick base pairing). Base pairing moieties include natural and unnatural bases. Non-limiting examples of base pairing moieties are RNA and DNA bases such adenosine, guanosine, thymidine, cytosine and uridine and analogues thereof.

Embodiments of the disclosure disclosed herein are also meant to encompass all compounds being isotopically-labelled by having one or more atoms replaced by an atom having a different atomic mass or mass number. Examples of isotopes that can be incorporated into the disclosed compounds include isotopes of hydrogen, carbon, nitrogen, oxygen, phosphorous, fluorine, chlorine, and iodine, such as ²H, ³H, ¹¹C, ¹³C, ¹⁴C, ¹³N, ¹⁵N, ¹⁵O, ¹⁷O, ¹⁸O, ³¹P, ³²P, ³⁵S, ¹⁸F, ³⁶Cl, ¹²³I and ¹²⁵I respectively.

Isotopically-labeled compounds of structure (I) can generally be prepared by conventional techniques known to those skilled in the art or by processes analogous to those described below and in the following Examples using an appropriate isotopically-labeled reagent in place of the non-labeled reagent previously employed.

“Stable compound” and “stable structure” are meant to indicate a compound that is sufficiently robust to survive isolation to a useful degree of purity from a reaction mixture, and formulation into an efficacious therapeutic agent.

“Optional” or “optionally” means that the subsequently described event or circumstances may or may not occur, and that the description includes instances where said event or circumstance occurs and instances in which it does not. For example, “optionally substituted alkyl” means that the alkyl group may or may not be substituted and that the description includes both substituted alkyl groups and alkyl groups having no substitution.

“Salt” includes both acid and base addition salts.

“Acid addition salt” refers to those salts which are formed with inorganic acids such as, but not limited to, hydrochloric acid, hydrobromic acid, sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as, but not limited to, acetic acid, 2,2-dichloroacetic acid, adipic acid, alginic acid, ascorbic acid, aspartic acid, benzenesulfonic acid, benzoic acid, 4-acetamidobenzoic acid, camphoric acid, camphor-10-sulfonic acid, capric acid, caproic acid, caprylic acid, carbonic acid, cinnamic acid, citric acid, cyclamic acid, dodecylsulfuric acid, ethane-1,2-disulfonic acid, ethanesulfonic acid, 2-hydroxyethanesulfonic acid, formic acid, fumaric acid, galactaric acid, gentisic acid, glucoheptonic acid, gluconic acid, glucuronic acid, glutamic acid, glutaric acid, 2-oxo-glutaric acid, glycerophosphoric acid, glycolic acid, hippuric acid, isobutyric acid, lactic acid, lactobionic acid, lauric acid, maleic acid, malic acid, malonic acid, mandelic acid, methanesulfonic acid, mucic acid, naphthalene-1,5-disulfonic acid, naphthalene-2-sulfonic acid, 1-hydroxy-2-naphthoic acid, nicotinic acid, oleic acid, orotic acid, oxalic acid, palmitic acid, pamoic acid, propionic acid, pyroglutamic acid, pyruvic acid, salicylic acid, 4-aminosalicylic acid, sebacic acid, stearic acid, succinic acid, tartaric acid, thiocyanic acid, p-toluenesulfonic acid, trifluoroacetic acid, undecylenic acid, and the like.

“Base addition salt” refers to those salts which are prepared from addition of an inorganic base or an organic base to the free acid. Salts derived from inorganic bases include, but are not limited to, sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, manganese, aluminum salts and the like. Salts derived from organic bases include, but are not limited to, salts of primary, secondary, and tertiary amines, substituted amines including naturally occurring substituted amines, cyclic amines and basic ion exchange resins, such as ammonia, isopropylamine, trimethylamine, diethylamine, triethylamine, tripropylamine, diethanolamine, ethanolamine, deanol, 2-dimethylaminoethanol, 2-diethylaminoethanol, dicyclohexylamine, lysine, arginine, histidine, caffeine, procaine, hydrabamine, choline, betaine, benethamine, benzathine, ethylenediamine, glucosamine, methylglucamine, theobromine, triethanolamine, tromethamine, purines, piperazine, piperidine, N-ethylpiperidine, polyamine resins and the like. Particularly preferred organic bases are isopropylamine, diethylamine, ethanolamine, trimethylamine, dicyclohexylamine, choline and caffeine.

Crystallizations may produce a solvate of the compounds described herein. Embodiments of the present disclosure include all solvates of the described compounds. As used herein, the term “solvate” refers to an aggregate that comprises one or more molecules of a compound of the disclosure with one or more molecules of solvent. The solvent may be water, in which case the solvate may be a hydrate. Alternatively, the solvent may be an organic solvent. Thus, the compounds of the present disclosure may exist as a hydrate, including a monohydrate, dihydrate, hemihydrate, sesquihydrate, trihydrate, tetrahydrate and the like, as well as the corresponding solvated forms. The compounds of the disclosure may be true solvates, while in other cases the compounds of the disclosure may merely retain adventitious water or another solvent or be a mixture of water plus some adventitious solvent.

Embodiments of the compounds of the disclosure (e.g., compounds of structure I), or their salts, tautomers or solvates may contain one or more stereocenters and may thus give rise to enantiomers, diastereomers, and other stereoisomeric forms that may be defined, in terms of absolute stereochemistry, as (R)- or (S)- or, as (D)- or (L)- for amino acids. Embodiments of the present disclosure are meant to include all such possible isomers, as well as their racemic and optically pure forms. Optically active (+) and (−), (R)- and (S)-, or (D)- and (L)-isomers may be prepared using chiral synthons or chiral reagents, or resolved using conventional techniques, for example, chromatography and fractional crystallization. Conventional techniques for the preparation/isolation of individual enantiomers include chiral synthesis from a suitable optically pure precursor or resolution of the racemate (or the racemate of a salt or derivative) using, for example, chiral high pressure liquid chromatography (HPLC). When the compounds described herein contain olefinic double bonds or other centers of geometric asymmetry, and unless specified otherwise, it is intended that the compounds include both E and Z geometric isomers. Likewise, all tautomeric forms are also intended to be included.

A “stereoisomer” refers to a compound made up of the same atoms bonded by the same bonds but having different three-dimensional structures, which are not interchangeable. The present disclosure contemplates various stereoisomers and mixtures thereof and includes “enantiomers”, which refers to two stereoisomers whose molecules are non-superimposable mirror images of one another.

A “tautomer” refers to a proton shift from one atom of a molecule to another atom of the same molecule. The present disclosure includes tautomers of any said compounds. Various tautomeric forms of the compounds are easily derivable by those of ordinary skill in the art.

The chemical naming protocol and structure diagrams used herein are a modified form of the I.U.P.A.C. nomenclature system, using the ACD/Name Version 9.07 software program and/or ChemDraw Ultra Version 11.0 software naming program (CambridgeSoft). Common names familiar to one of ordinary skill in the art are also used.

As noted above, in one embodiment of the present disclosure, compounds useful as fluorescent and/or colored dyes in various analytical methods are provided. In other embodiments, compounds useful as synthetic intermediates for preparation of compounds useful as fluorescent and/or colored dyes are provided. In general terms, embodiments of the present disclosure are directed to dimers and higher polymers of fluorescent and/or colored moieties. The fluorescent and or colored moieties are linked by a linking moiety. Without wishing to be bound by theory, it is believed the linker helps to maintain sufficient spatial distance between the fluorescent and/or colored moieties such that intramolecular quenching is reduced or eliminated, thus resulting in a dye compound having a high molar “brightness” (e.g., high fluorescence emission).

Accordingly, in some embodiments the compounds have the following structure (A):

wherein L is a linker (e.g., heteroalkylene) sufficient to maintain spatial separation between one or more (e.g., each) M¹ group so that intramolecular quenching is reduced or eliminated, and R¹, R², L^(1a), L^(1b), L², L³ and n are as defined for structure (I). In some embodiments of structure (A), L is a linker comprising one or more ethylene glycol or polyethylene glycol moieties.

In other embodiments is provided a compound having the following structure (I):

or a stereoisomer, salt or tautomer thereof, wherein:

M¹ and M² are, at each occurrence, independently a moiety comprising a chromophore;

L^(1a) is, at each occurrence, independently a heteroarylene linker;

L^(1b), L², L³, L⁵, L⁶ and L⁷ are, at each occurrence, independently optional alkylene, alkenylene, alkynylene, heteroalkylene, heteroalkenylene or heteroalkynylene linkers;

L⁴ is, at each occurrence, independently an alkylene, alkenylene, alkynylene, heteroalkylene, heteroalkenylene or heteroalkynylene linker;

R¹ and R² are each independently H, OH, SH, alkyl, alkoxy, alkylether, heteroalkyl, —OP(═R_(a))(R_(b))R_(c), Q, or a protected form thereof, or L′;

R³ is, at each occurrence, independently H, alkyl or alkoxy;

R⁴ is, at each occurrence, independently OH, SH, O⁻, S⁻, OR_(d) or SR_(d);

R⁵ is, at each occurrence, independently oxo, thioxo or absent;

R_(a) is O or S;

R_(b) is OH, SH, O⁻, S⁻, OR_(d) or SR_(d);

R_(c) is OH, SH, O⁻, S⁻, OR_(d), OL′, SR_(d), alkyl, alkoxy, heteroalkyl, heteroalkoxy, alkylether, alkoxyalkylether, phosphate, thiophosphate, phosphoalkyl, thiophosphoalkyl, phosphoalkylether or thiophosphoalkylether;

R_(d) is a counter ion;

Q is, at each occurrence, independently a moiety comprising a reactive group, or protected form thereof, capable of forming a covalent bond with an analyte molecule, a targeting moiety, a solid support or a complementary reactive group Q′;

L′ is, at each occurrence, independently a linker comprising a covalent bond to Q, a linker comprising a covalent bond to a targeting moiety, a linker comprising a covalent bond to an analyte molecule, a linker comprising a covalent bond to a solid support, a linker comprising a covalent bond to a solid support residue, a linker comprising a covalent bond to a nucleoside or a linker comprising a covalent bond to a further compound of structure (I);

m is, at each occurrence, an integer of one or greater;

n is an integer of one or greater; and

q and w are, at each occurrence, independently 0 or 1, provided at least one occurrence of w is 1.

The various linkers and substituents (e.g., M¹, M², Q, R¹, R², R³, R_(c), L^(1a), L^(1b), L², L³, L⁴, L⁵, L⁶ and L⁷) in the compound of structure (I) are optionally substituted with one more substituent. For example, in some embodiments the optional substituent is selected to optimize the water solubility or other property of the compound of structure (I). In certain embodiments, each chromophore, alkyl, alkoxy, alkylether, heteroarylene, heteroalkyl, alkylene, alkenylene, alkynylene, heteroalkylene, heteroalkenylene, heteroalkynylene, alkoxyalkylether, phosphoalkyl, thiophosphoalkyl, phosphoalkylether and thiophosphoalkylether in the compound of structure (I) is optionally substituted with one more substituent selected from the group consisting of hydroxyl, alkoxy, alkylether, alkoxyalkylether, sulfhydryl, amino, alkylamino, carboxyl, phosphate, thiophosphate, phosphoalkyl, thiophosphoalkyl, phosphoalkylether and thiophosphoalkylether. In certain embodiments the optional substituent is —OP(═Ra)(Rb)Rc, where Ra, Rb and Rc are as defined for the compound of structure (I).

In some embodiments, at least one occurrence of L^(1a) is an optionally substituted 5-7 membered heteroarylene linker. In some more specific embodiments, L^(1a) is, at each occurrence independently an optionally substituted 5-7 membered heteroarylene linker. In some embodiments, L^(1a) is a 6-membered heteroarylene. In some embodiments, L^(1a) comprises two N atoms and two O atoms. In certain embodiments, L^(1a) is, at each occurrence, substituted. In some related embodiments, L^(1b) is substituted, for example, L^(1b) is substituted with oxo, alkyl (e.g., methyl, ethyl, etc.) or combinations thereof. In more specific embodiments, L^(1a) is, at each occurrence, substituted with at least one oxo. In some embodiments, L^(1a) has one of the following structures:

In some embodiments, L^(1b) is, at each occurrence, independently an optional alkylene, alkenylene, alkynylene, heteroalkylene, heteroalkenylene, heteroalkynylene, alkyleneheteroarylenealkylene, alkyleneheterocyclylenealkylene, alkylenecarbocyclylenealkylene, heteroalkyleneheteroarylenealkylene, heteroalkyleneheterocyclylenealkylene, heteroalkylenecarbocyclylenealkylene, heteroalkyleneheteroaryleneheteroalkylene, heteroalkyleneheterocyclyleneheteroalkylene, heteroalkylenecarbocyclyleneheteroalkylene, alkyleneheteroaryleneheteroalkylene, alkyleneheterocyclyleneheteroalkylene, alkylenecarbocyclyleneheteroalkylene, heteroarylene, heterocyclylene, carbocyclylene, alkyleneheteroarylene, alkyleneheterocyclylene, heteroarylenealkylene, alkylenecarbocyclylene, carbocyclylenealkylene, heteroalkyleneheteroarylene, heteroalkyleneheterocyclylene, heteroaryleneheteroalkylene, heteroalkylenecarbocyclylene, carbocyclyleneheteroalkylene or heteroatomic linker. In some embodiments, L^(1b) is an optionally substituted heteroalkenylene linker.

In some embodiments, at least one occurrence of L^(1b) is substituted. In certain embodiments, L^(1b) is substituted at each occurrence. In some more specific embodiments, L^(1b) is substituted with oxo.

In other embodiments, L^(1b) is at each occurrence, independently a linker comprising a functional group capable of formation by reaction of two complementary reactive groups (e.g., triazolyl, amide, etc.), for example a Q group.

The optional linkers L^(1b) and L⁷ can be used as a point of attachment of the M¹ and M² moieties to the remainder of the compound. For example, in some embodiments a synthetic precursor to the compound of structure (I) is prepared, and the M¹ and M² moieties are attached to the synthetic precursor using any number of facile methods known in the art, for example methods referred to as “click chemistry.” For this purpose any reaction which is rapid and substantially irreversible can be used to attach M¹ and M² to the synthetic precursor to form a compound of structure (I). Exemplary reactions include the copper catalyzed reaction of an azide and alkyne to form a triazole (Huisgen 1, 3-dipolar cycloaddition), reaction of a diene and dienophile (Diels-Alder), strain-promoted alkyne-nitrone cycloaddition, strain-promoted cycloalkyne-azide cycloaddition (Cu-free click), reaction of a strained alkene with an azide, tetrazine or tetrazole, alkene and azide [3+2] cycloaddition, alkene and tetrazine inverse-demand Diels-Alder, alkene and tetrazole photoreaction and various displacement reactions, such as displacement of a leaving group by nucleophilic attack on an electrophilic atom. Exemplary displacement reactions include reaction of an amine with: an activated ester; an N-hydroxysuccinimide ester; an isocyanate; an isothioscyanate or the like. In some embodiments the reaction to form L^(1b) or L⁷ may be performed in an aqueous environment.

Accordingly, in some embodiments L^(1b) or L⁷ are at each occurrence, independently a linker comprising a functional group capable of formation by reaction of two complementary reactive groups, for example a functional group which is the product of one of the foregoing “click” reactions. In various embodiments, for at least one occurrence of L^(1b) or L⁷, the functional group can be formed by reaction of an aldehyde, oxime, hydrazone, alkyne, amine, azide, acylazide, acylhalide, nitrile, nitrone, sulfhydryl, disulfide, sulfonyl halide, isothiocyanate, imidoester, activated ester (e.g., N-hydroxysuccinimide ester), ketone, α,β-unsaturated carbonyl, alkene, maleimide, α-haloimide, epoxide, aziridine, tetrazine, tetrazole, phosphine, biotin or thiirane functional group with a complementary reactive group, for example, via a reaction of an amine with an N-hydroxysuccinimide ester or isothiocyanate.

In other embodiments, for at least one occurrence of L^(1b) or L⁷, the functional group can be formed by reaction of an alkyne and an azide. In other embodiments, for at least one occurrence of L^(1b) or L⁷, the functional group can be formed by reaction of an amine (e.g., primary amine) and an N-hydroxysuccinimide ester or isothiocyanate.

In more embodiments, for at least one occurrence of L^(1b) or L⁷, the functional group comprises an alkene, ester, amide, thioester, disulfide, carbocyclic, heterocyclic or heteroaryl group. In more embodiments, for at least one occurrence of L^(1b) or L⁷, the functional group comprises an alkene, ester, amide, thioester, thiourea, disulfide, carbocyclic, heterocyclic or heteroaryl group. In other embodiments, the functional group comprises an amide or thiourea. In some more specific embodiments, for at least one occurrence of L^(1b) or L⁷, L^(1b) or L⁷ are linkers comprising a triazolyl functional group. In some related embodiments, L^(1b) or L⁷, at each occurrence, independently comprises a triazolyl functional group. While in other embodiments, for at least one occurrence of L^(1b) or L⁷ is a linker comprising an amide or thiourea functional group.

In still other embodiments, for at least one occurrence of L^(1b), L^(1b)-M¹ has the following structure:

wherein L^(1c) and L^(1d) are each independently optional linkers.

In different embodiments, for at least one occurrence of L^(1b), L^(1b)-M¹ has the following structure:

wherein L^(1c) and L^(1d) are each independently optional linkers.

In various embodiments of the foregoing, L^(1c) or L^(1d), or both, is absent. In other embodiments, L^(1c) or L^(1d), or both, is present.

In some embodiments L^(1c) and L^(1d), when present, are each independently alkylene or heteroalkylene. For example, in some embodiments L^(1c) and L^(1d), when present, independently have one of the following structures:

In still other embodiments, for at least one occurrence of L⁷, L⁷-M² has the following structure:

wherein L^(1e) and L^(1f) are each independently optional linkers.

In different embodiments, for at least one occurrence of L⁷, L⁷-M² has the following structure:

wherein L^(1e) and L^(1f) are each independently optional linkers.

In various embodiments of the foregoing, L^(1e) or L^(1f), or both, is absent.

In other embodiments, L^(1e) or L^(1f), or both, is present.

In some embodiments L^(1e) and L^(1f), when present, are each independently alkylene or heteroalkylene. For example, in some embodiments L^(1e) and L^(1f), when present, independently have one of the following structures:

In some embodiments, at least one occurrence of L^(1b) has one of the following structures:

wherein

a, b, and c are each independently an integer ranging from 1-6.

In some embodiments, each occurrence of L^(1b) has one of the following structures:

wherein

a, b, and c are each independently an integer ranging from 1-6.

In some embodiments, at least one occurrence of L^(1b) has one of the following structures:

In still other different embodiments of structure (I), L^(1b) is at each occurrence, independently an optional alkylene or heteroalkylene linker. In certain embodiments, L^(1b) has one of the following structures:

In still other different embodiments of structure (I), L⁷ is at each occurrence, independently an optional alkylene or heteroalkylene linker. In certain embodiments, L⁷ has one of the following structures:

In some embodiments, at least one occurrence of L³ is an alkylene linker. In more specific embodiments, L³ or is an alkylene linker at each occurrence. In certain embodiments, the alkylene linker is a methylene linker.

In some embodiments, at least one occurrence of L² is absent. In more specific embodiments, L² is absent at each occurrence.

In certain embodiments, at least one occurrence of L⁵ or L⁶ is a heteroalkylene linker. In some more specific embodiments, L⁵ or L⁶ is a heteroalkylene linker at each occurrence. In some embodiments, at least one occurrence of L⁴ comprises alkylene oxide. In some embodiments, at least one occurrence of L⁵ or L⁶ comprises alkylene oxide. In some of the foregoing embodiments, the alkylene oxide is ethylene oxide, for example, polyethylene oxide. In certain embodiments, at least one occurrence of L⁵ or L⁶ is an alkylene linker (e.g., methylene). In some more specific embodiments, L⁵ or L⁶ is an alkylene linker at each occurrence (e.g., methylene).

In certain embodiments, at least one occurrence of L⁵ is a heteroalkylene linker. In some more specific embodiments, L⁵ is a heteroalkylene linker at each occurrence. In some embodiments, at least one occurrence of L⁵ comprises alkylene oxide, for example, ethylene oxide (e.g., polyethylene oxide). In certain embodiments, at least one occurrence of L⁵ is an alkylene linker (e.g., methylene). In some more specific embodiments, L⁵ is an alkylene linker at each occurrence (e.g., methylene). In certain embodiments, at least one occurrence of L⁵ is absent. In some more specific embodiments, L⁵ is absent at each occurrence.

In certain embodiments, at least one occurrence of L⁶ is a heteroalkylene linker. In some more specific embodiments, L⁶ is a heteroalkylene linker at each occurrence. In some embodiments, at least one occurrence of L⁶ comprises alkylene oxide. In some of the foregoing embodiments, the alkylene oxide is ethylene oxide, for example, polyethylene oxide. In certain embodiments, at least one occurrence of L⁶ is an alkylene linker (e.g., methylene). In some more specific embodiments, L⁶ is an alkylene linker at each occurrence (e.g., methylene). In certain embodiments, at least one occurrence of L⁶ is absent. In some more specific embodiments, L⁶ is absent at each occurrence.

In certain embodiments, at least one occurrence of L⁵ or L⁶ comprises a phosphodiester moiety. In more specific embodiments, each occurrence of L⁵ or L⁶ comprises a phosphodiester moiety. In more embodiments, L², L³, L⁴ or L⁶ are, at each occurrence, independently C₁-C₆ alkylene, C₂-C₆ alkenylene or C₂-C₆ alkynylene.

In some embodiments, at least one occurrence of L⁵ is heteroalkylene. In some embodiments, L⁵ is heteroalkylene at each occurrence, for example, a heteroalkylene comprising one of the following structures:

In some embodiments, at least one occurrence of L⁶ is heteroalkylene. In some embodiments, L⁶ is heteroalkylene at each occurrence, for example, a heteroalkylene comprising one of the following structures:

In some of the foregoing embodiments, a heteroalkylene (e.g., L³, L⁴, L⁵ or L⁶) comprises the following structure:

wherein

z is an integer ranging from 19 to 30. In some embodiments, z ranges from 19-28. In certain embodiments, the average z is 23. In some embodiments, the average z is 19, 20, 21, 22, 23, 24, 25, 26, 27, or 28.

In some embodiments, at least one occurrence of R³ is H. In more specific embodiments, R³ is H at each occurrence.

In some embodiments, m is 0. In some of the foregoing embodiments, q is 0. In some related embodiments, the compound has the following structure (Ia):

In some other embodiments, the compound has one of the following structures (Ib) or (Ic):

wherein:

L^(1b) is, at each occurrence, independently an optionally substituted alkylene or an optionally substituted heteroalkylene linker.

In some embodiments, the compound has one of the following structures (Id) or (Ie):

wherein:

z is an integer from 1 to 100. In some embodiments, L^(1b), at each occurrence, independently comprises an amide functional group or a triazolyl functional group.

In still other embodiments of any of the compounds of structure (I), R⁵ is, at each occurrence, independently OH, O⁻ or OR_(d). It is understood that “OR_(d)” and “SR_(d)” are intended to refer to O⁻ and S⁻ associated with a cation. For example, the disodium salt of a phosphate group may be represented as:

where R_(d) is sodium (Na⁺).

In other embodiments of any of the compounds of structure (I), at least one occurrence of R⁴ is oxo. In other embodiments of any of the compounds of structure (I), R⁴ is, at each occurrence, oxo.

In other various embodiments, R¹ and R² are each independently OH or —OP(═R_(a))(R_(b))R_(c). In some different embodiments, R¹ or R² is OH or —OP(═R_(a))(R_(b))R_(c), and the other of R¹ or R² is Q or a linker comprising a covalent bond to Q.

In still more different embodiments of any of the foregoing compounds of structure (I), R¹ and R² are each independently —OP(═R_(a))(R_(b))R_(c). In some of these embodiments, R_(c) is OL′.

In other embodiments, R¹ and R² are each independently —OP(═R_(a))(R_(b))OL′, and L′ is an alkylene or heteroalkylene linker to: Q, a targeting moiety, an analyte (e.g., analyte molecule), a solid support, a solid support residue, a nucleoside or a further compound of structure (I).

The linker L′ can be any linker suitable for attaching Q, a targeting moiety, an analyte (e.g., analyte molecule), a solid support, a solid support residue, a nucleoside or a further compound of structure (I) to the compound of structure (I). Advantageously certain embodiments include use of L′ moieties selected to increase or optimize water solubility of the compound. In certain embodiments, L′ is a heteroalkylene moiety. In some other certain embodiments, L′ comprises an alkylene oxide or phosphodiester moiety, or combinations thereof.

In certain embodiments, L′ has the following structure:

wherein:

m″ and n″ are independently an integer from 1 to 10;

R^(e) is H, an electron pair or a counter ion;

L″ is R^(e) or a direct bond or linkage to: Q, a targeting moiety, an analyte (e.g., analyte molecule), a solid support, a solid support residue, a nucleoside or a further compound of structure (I).

In some embodiments, m″ is an integer from 4 to 10, for example 4, 6 or 10. In other embodiments n″ is an integer from 3 to 6, for example 3, 4, 5 or 6. In some embodiments, n″ is an integer from 18-28, for example, from 21-23.

In some other embodiments, L″ is an alkylene, alkyleneheterocyclylene, alkyleneheterocyclylenealkylene, alkylenecyclylene, alkylenecyclylenealkylene, heteroalkylene, heteroalkyleneheterocyclylene, heteroalkyleneheterocyclyleneheteroalkylene, heteroalkylenecyclylene, or heteroalkylenecycleneheteroalkylene moiety. In some other certain embodiments, L″ comprises an alkylene oxide, phosphodiester moiety, sulfhydryl, disulfide or maleimide moiety or combinations thereof.

In certain of the foregoing embodiments, the targeting moiety is an antibody or cell surface receptor antagonist.

In other more specific embodiments of any of the foregoing compounds of structure (I), R¹ or R² has one of the following structures:

In other more specific embodiments of any of the foregoing compounds of structure (I), R¹ or R² has one of the following structures:

Certain embodiments of compounds of structure (I) can be prepared according to solid-phase synthetic methods analogous to those known in the art for preparation of oligonucleotides. Accordingly, in some embodiments, L′ is a linkage to a solid support, a solid support residue or a nucleoside. Solid supports comprising an activated deoxythymidine (dT) group are readily available, and in some embodiments can be employed as starting material for preparation of compounds of structure (I). Accordingly, in some embodiments R¹ or R² has the following structure:

One of skill in the art will understand that the dT group depicted above is included for ease of synthesis and economic efficiencies only, and is not required. Other solid supports can be used and would result in a different nucleoside or solid support residue being present on L′, or the nucleoside or solid support residue can be removed or modified post synthesis.

In still other embodiments, Q is, at each occurrence, independently a moiety comprising a reactive group capable of forming a covalent bond with an analyte molecule or a solid support. In other embodiments, Q is, at each occurrence, independently a moiety comprising a reactive group capable of forming a covalent bond with a complementary reactive group Q′. For example, in some embodiments, Q′ is present on a further compound of structure (I) (e.g., in the R¹ or R² position), and Q and Q′ comprise complementary reactive groups such that reaction of the compound of structure (I) and the further compound of structure (I) results in covalently bound dimer of the compound of structure (I). Multimer compounds of structure (I) can also be prepared in an analogous manner and are included within the scope of embodiments of the disclosure.

The type of Q group and connectivity of the Q group to the remainder of the compound of structure (I) is not limited, provided that Q comprises a moiety having appropriate reactivity for forming the desired bond.

In certain embodiments, Q is a moiety which is not susceptible to hydrolysis under aqueous conditions, but is sufficiently reactive to form a bond with a corresponding group on an analyte molecule or solid support (e.g., an amine, azide or alkyne).

Certain embodiments of compounds of structure (I) comprise Q groups commonly employed in the field of bioconjugation. For example in some embodiments, Q comprises a nucleophilic reactive group, an electrophilic reactive group or a cycloaddition reactive group. In some more specific embodiments, Q comprises a sulfhydryl, disulfide, activated ester, isothiocyanate, azide, alkyne, alkene, diene, dienophile, acid halide, sulfonyl halide, phosphine, α-haloamide, biotin, amino or maleimide functional group. In some embodiments, the activated ester is an N-succinimide ester, imidoester or polyflourophenyl ester. In other embodiments, the alkyne is an alkyl azide or acyl azide.

The Q groups can be conveniently provided in protected form to increase storage stability or other desired properties, and then the protecting group removed at the appropriate time for conjugation with, for example, a targeting moiety or analyte. Accordingly, Q groups include “protected forms” of a reactive group, including any of the reactive groups described above and in the Table 1 below. A “protected form” of Q refers to a moiety having lower reactivity under predetermined reaction conditions relative to Q, but which can be converted to Q under conditions, which preferably do not degrade or react with other portions of the compound of structure (I). One of skill in the art can derive appropriate protected forms of Q based on the particular Q and desired end use and storage conditions. For example, when Q is SH, a protected form of Q includes a disulfide, which can be reduce to reveal the SH moiety using commonly known techniques and reagents.

Exemplary Q moieties are provided in Table I below.

TABLE 1 Exemplary Q Moieties Structure Class

Sulfhydryl

Isothiocyanate

Imidoester

Acyl Azide

Activated Ester

Activated Ester

Activated Ester

Activated Ester

Activated Ester

Activated Ester

Sulfonyl halide

Maleimide

Maleimide

Maleimide

α-haloimide

Disulfide

Phosphine

Azide

Alkyne

Biotin

Diene

Alkene/dienophile

Alkene/dienophile

Amino

It should be noted that in some embodiments, wherein Q is SH, the SH moiety will tend to form disulfide bonds with another sulfhydryl group, for example on another compound of structure (I). Accordingly, some embodiments include compounds of structure (I), which are in the form of disulfide dimers, the disulfide bond being derived from SH Q groups.

Also included within the scope of certain embodiments are compounds of structure (I), wherein one, or both, of R¹ and R² comprises a linkage to a further compound of structure (I). For example, wherein one or both of R¹ and R² are —OP(═R_(a))(R_(b))R_(c), and R_(c) is OL′, and L′ is a linker comprising a covalent bond to a further compound of structure (I). Such compounds can be prepared by preparing a first compound of structure (I) having for example about 10 “M¹” and/or “M²” moieties (i.e., n=10) and having an appropriate “Q” for reaction with a complementary Q′ group on a second compound of structure (I). In this manner, compounds of structure (I), having any number of “M¹” and/or “M²” moieties, for example 100 or more, can be prepared without the need for sequentially coupling each monomer. Exemplary embodiments of such compounds of structure (I) have the following structure (I′):

wherein:

each occurrence of R¹, R², R³, R⁴, R⁵, L^(1a), L^(1b), L², L³, L⁴, L₅, L⁶, L⁷, M¹, M¹, q, m, w and n are independently as defined for a compound of structure (I);

L″ is a linker comprising a functional group resulting from reaction of a Q moiety with a corresponding Q′ moiety; and

α is an integer greater than 1, for example from 1 to 100, or 1 to 10.

Compounds of structure (I′) are derivable by those of ordinary skill in the art, for example by dimerizing or polymerizing compounds of structure (I) provided herein.

In other embodiments, the Q moiety is conveniently masked (e.g., protected) as a disulfide moiety, which can later be reduced to provide an activated Q moiety for binding to a desired analyte molecule or targeting moiety. For example, the Q moiety may be masked as a disulfide having the following structure:

wherein R is an optionally substituted alkyl group. For example, in some embodiments, Q is provided as a disulfide moiety having the following structure:

where n is an integer from 1 to 10.

In some other embodiments, one of R¹ or R² is OH or —OP(═R_(a))(R_(b))R_(c), and the other of R¹ or R² is a linker comprising a covalent bond to an analyte molecule or a linker comprising a covalent bond to a solid support. For example, in some embodiments the analyte molecule is a nucleic acid, amino acid or a polymer thereof. In other embodiments, the analyte molecule is an enzyme, receptor, receptor ligand, antibody, glycoprotein, aptamer or prion. In some embodiments, the targeting moiety is an antibody or cell surface receptor antagonist. In still different embodiments, the solid support is a polymeric bead or non-polymeric bead.

The fluorescence intensity can also be tuned by selection of different values of n. In certain embodiments, n is an integer from 1 to 100. In other embodiments, n is an integer from 1 to 10. In some embodiments, n is 1. In some embodiments, n is 2. In some embodiments, n is 3. In some embodiments, n is 4. In some embodiments, n is 5. In some embodiments, n is 6. In some embodiments, n is 7. In some embodiments, n is 8. In some embodiments, n is 9. In some embodiments, n is 10.

The fluorescence may also be tuned by selection of values for m. In certain embodiments, m is an integer from 1 to 100. In other embodiments, m is an integer from 7 to 12. In some embodiments, m is an integer from 20 to 26. In some embodiments, m is an integer from 3 to 6. In some embodiments, m is 3. In some embodiments, m is 4. In some embodiments, m is 5. In some embodiments, m is 6. In some embodiments, m is 7. In some embodiments, m is 8. In some embodiments, m is 9. In some embodiments, m is 10. In some embodiments, m is 11.

M¹ and M² are selected based on the desired optical properties, for example based on a desired color and/or fluorescence emission wavelength. In some embodiments, M¹ and M² are the same at each occurrence; however, it is important to note that each occurrence of M¹ and M² need not be an identical M¹ and M², and certain embodiments include compounds wherein M¹ and M² are not the same at each occurrence. For example, in some embodiments each M¹ and M² are not the same and the different M¹ and M² moieties are selected to have absorbance and/or emissions for use in fluorescence resonance energy transfer (FRET) methods. For example, in such embodiments the different M¹ and M² moieties are selected such that absorbance of radiation at one wavelength causes emission of radiation at a different wavelength by a FRET mechanism. Exemplary M¹ and M² moieties can be appropriately selected by one of ordinary skill in the art based on the desired end use. Exemplary M¹ and M² moieties for FRET methods include fluorescein and 5-TAMRA (5-carboxytetramethylrhodamine, succinimidyl ester) dyes.

M¹ or M² may be attached to the remainder of the molecule from any position (i.e., atom) on M¹ or M², respectively. One of skill in the art will recognize means for attaching M¹ or M² to the remainder of molecule. Exemplary methods include the “click” reactions described herein.

In some embodiments, M¹ and M² are, at each occurrence, independently a fluorescent or colored moiety. Any fluorescent and/or colored moiety may be used, for examples those known in the art and typically employed in colorimetric, UV, and/or fluorescent assays may be used. Examples of M¹ and M² moieties which are useful in various embodiments of the disclosure include, but are not limited to: Xanthene derivatives (e.g., fluorescein, rhodamine, Oregon green, eosin or Texas red); Cyanine derivatives (e.g., cyanine, indocarbocyanine, oxacarbocyanine, thiacarbocyanine or merocyanine); Squaraine derivatives and ring-substituted squaraines, including Seta, SeTau, and Square dyes; Naphthalene derivatives (e.g., dansyl and prodan derivatives); Coumarin derivatives; oxadiazole derivatives (e.g., pyridyloxazole, nitrobenzoxadiazole or benzoxadiazole); Anthracene derivatives (e.g., anthraquinones, including DRAQ5, DRAQ7 and CyTRAK Orange); Pyrene derivatives such as cascade blue; Oxazine derivatives (e.g., Nile red, Nile blue, cresyl violet, oxazine 170); Acridine derivatives (e.g., proflavin, acridine orange, acridine yellow); Arylmethine derivatives: auramine, crystal violet, malachite green; and Tetrapyrrole derivatives (e.g., porphin, phthalocyanine or bilirubin). Other exemplary M¹ and M² moieties include: Cyanine dyes, xanthate dyes (e.g., Hex, Vic, Nedd, Joe or Tet); Yakima yellow; Redmond red; tamra; texas red and Alexa Fluor® dyes.

In still other embodiments of any of the foregoing, M¹ and M² each occurrence independently comprises three or more aryl or heteroaryl rings, or combinations thereof, for example four or more aryl or heteroaryl rings, or combinations thereof, or even five or more aryl or heteroaryl rings, or combinations thereof. In some embodiments, M¹ and M² each occurrence independently comprises six aryl or heteroaryl rings, or combinations thereof. In further embodiments, the rings are fused. For example in some embodiments, M¹ and M² each occurrence independently comprises three or more fused rings, four or more fused rings, five or more fused rings, or even six or more fused rings.

In some embodiments, M¹ and M² are, at each occurrence, independently cyclic. For example, in some embodiments M¹ and M² are, at each occurrence, independently carbocyclic. In other embodiment, M¹ and M² are, at each occurrence, independently heterocyclic. In still other embodiments of the foregoing, M¹ and M², at each occurrence, independently comprises an aryl moiety. In some of these embodiments, the aryl moiety is multicyclic. In other more specific examples, the aryl moiety is a fused-multicyclic aryl moiety, for example which may comprise at least 2, at least 3, at least 4, or even more than 4 aryl rings.

In other embodiments of any of the foregoing compounds of structure (I), (Ia), (Ib), (Ic), (Id), (Ie) or (I′), M¹ or M², at each occurrence, independently comprises at least one heteroatom. For example, in some embodiments, the heteroatom is nitrogen, oxygen or sulfur.

In still more embodiments of any of the foregoing, M¹ and M², at each occurrence, independently comprises at least one substituent. For example, in some embodiments the substituent is a fluoro, chloro, bromo, iodo, amino, alkylamino, arylamino, hydroxy, sulfhydryl, alkoxy, aryloxy, phenyl, aryl, methyl, ethyl, propyl, butyl, isopropyl, t-butyl, carboxy, sulfonate, amide, or formyl group.

In some even more specific embodiments of the foregoing, M¹ and M², at each occurrence, independently is a dimethylaminostilbene, quinacridone, fluorophenyl-dimethyl-BODIPY, his-fluorophenyl-BODIPY, acridine, terrylene, sexiphenyl, porphyrin, benzopyrene, (fluorophenyl-dimethyl-difluorobora-diaza-indacene)phenyl, (bis-fluorophenyl-difluorobora-diaza-indacene)phenyl, quaterphenyl, bi-benzothiazole, ter-benzothiazole, bi-naphthyl, bi-anthracyl, squaraine, squarylium, 9, 10-ethynylanthracene or ter-naphthyl moiety. In other embodiments, M¹ and M² are, at each occurrence, independently p-terphenyl, perylene, azobenzene, phenazine, phenanthroline, acridine, thioxanthrene, chrysene, rubrene, coronene, cyanine, perylene imide, or perylene amide or a derivative thereof. In still more embodiments, M¹ and M² are, at each occurrence, independently a coumarin dye, resorufin dye, dipyrrometheneboron difluoride dye, ruthenium bipyridyl dye, energy transfer dye, thiazole orange dye, polymethine or N-aryl-1,8-naphthalimide dye.

In still more embodiments of any of the foregoing, M¹ and M² at each occurrence are the same. In other embodiments, each M¹ and M² are different. In still more embodiments, one or more M¹ and M² are the same and one or more M¹ and M² are different.

In some embodiments, M¹ and M² are, at each occurrence independently pyrene, perylene, perylene monoimide or 6-FAM or a derivative thereof. In some other embodiments, M¹ and M², at each occurrence, independently has one of the following structures:

Although M¹ and M² moieties comprising carboxylic acid groups are depicted in the anionic form (CO₂ ⁻) above, one of skill in the art will understand that this will vary depending on pH, and the protonated form (i.e., —CO₂H) is included in various embodiments.

In some specific embodiments, the compound is a compound selected from Table 2. The compounds in Table 2 were prepared according to the procedures set forth in the Examples and their identity confirmed by mass spectrometry.

TABLE 2 Exemplary Compounds of Structure I MW. Found No. Calc. Structure I-1 15141.9 15137  

I-2 7924.2  7969.7† (avg)

I-3 14449.3 —

†Compound I-2 is drawn as a structure representing the average molecular weight (i.e., having 23 ethylene glycol units)

As used in Table 2 and throughout the application R¹, R², n, and L′ have the definitions provided for compounds of structure (I) unless otherwise indicated, and F, F′ and F″ refer to a fluorescein moiety having the following structures, respectively:

In some embodiments, M¹ or M² is, at each occurrence, independently F, F′ or F″.

It is well known in the art that fluorescein moieties tautomerize between quinoid, zwitterionic, and lactoid forms. One of skill in the art will readily understand that the form is dependent on pH and each form (e.g., quinoid, zwitterionic, and lactoid) are also included in the scope of embodiments of the disclosure.

As used in Tables 2 above and throughout this disclosure dT refers to the following structure:

wherein:

R is H or a direct bond.

As used throughout this disclosure, B and B′ refer to the following structures, respectively:

In some embodiments, M¹ or M² is, at each occurrence, independently B or B′.

As used throughout this disclosure, T refers to the following structure:

In specific embodiments, M¹ or M² is, at each occurrence, independently T.

As used throughout this disclosure, C refers to the following structure:

In some embodiments, M¹ or M² is, at each occurrence, independently C.

As used throughout this disclosure, Y refers to the following structure:

In some embodiments, M¹ or M² is, at each occurrence, independently Y.

Some embodiments include any of the foregoing compounds, including the specific compounds provided in Table 2, conjugated to a targeting moiety, such as an antibody.

The present disclosure generally provides compounds having increased fluorescence emission relative to earlier known compounds. Accordingly, certain embodiments are directed to a fluorescent compound comprising n fluorescent moieties M¹ and/or M², wherein the fluorescent compound has a peak fluorescence emission upon excitation with a predetermined wavelength of ultraviolet light of at least 85% of n times greater than the peak fluorescence emission of a single M¹ or M² moiety upon excitation with the same wavelength of ultraviolet light, and wherein n is an integer of 2 or more. Fluorescent compounds include compounds which emit a fluorescent signal upon excitation with light, such as ultraviolet light.

In some embodiments, the fluorescent compound has a peak fluorescence emission of at least 90% of n times greater, 95% of n times greater, 97% of n times greater or 99% of n times greater than the peak fluorescence emission of a single M¹ and/or M² moiety.

In some embodiments, n is an integer from 2 to 100, for example 2-10.

In some embodiments, the n M¹ and/or M¹ moieties have, independently, one of the following structures:

wherein

indicates a point of attachment to the fluorescent compound.

In other embodiments, the single M¹ or M² moiety has, independently, one of the following structures:

In more specific embodiments, the fluorescent compound comprises n M¹ and/or M² moieties, independently having one of the following structures:

wherein

indicates a point of attachment to the fluorescent compound, and the single M¹ or M² moiety has the following structure.

In other embodiments, the peak fluorescence emission is at a wavelength ranging from about 500 to about 550 nm.

In still more embodiments, the fluorescent compound comprises at least one ethylene oxide moiety.

Compositions comprising the fluorescent compound of any one of claims and an analyte are also provided.

The presently disclosed compounds are “tunable,” meaning that by proper selection of the variables in any of the foregoing compounds, one of skill in the art can arrive at a compound having a desired and/or predetermined molar fluorescence (molar brightness). The tunability of the compounds allows the user to easily arrive at compounds having the desired fluorescence and/or color for use in a particular assay or for identifying a specific analyte of interest. Although all variables may have an effect on the molar fluorescence of the compounds, proper selection of M¹, M², L^(1a), L^(1b), L³, L⁴, q, w, m and n is believed to play an important role in the molar fluorescence of the compounds. Accordingly, in one embodiment is provided a method for obtaining a compound having a desired molar fluorescence, the method comprising selecting M¹ or M² moieties having a known fluorescence, preparing a compound of structure (I) comprising the M¹ or M² moieties, and selecting the appropriate variables for M¹, M², L^(1a), L^(1b), L³, L⁴, q, w, m and n to arrive at the desired molar fluorescence.

Molar fluorescence in certain embodiments can be expressed in terms of the fold increase or decrease relative to the fluorescence emission of the parent fluorophore (e.g., monomer). In some embodiments the molar fluorescence of the present compounds is 1.1×, 1.5×, 2×, 3×, 4×, 5×, 6×, 7×, 8×, 9×, 10× or even higher relative to the parent fluorophore. Various embodiments include preparing compounds having the desired fold increase in fluorescence relative to the parent fluorophore by proper selection of M¹, M², L^(1a), L^(1b), L³, L⁴, q, w, m and n.

For ease of illustration, various compounds comprising phosphorous moieties (e.g., phosphate and the like) are depicted in the anionic state (e.g., —OPO(OH)O⁻, —OPO₃ ²⁻). One of skill in the art will readily understand that the charge is dependent on pH and the uncharged (e.g., protonated or salt, such as sodium or other cation) forms are also included in the scope of embodiments of the disclosure.

Compositions comprising any of the foregoing compounds and one or more analyte molecules (e.g., biomolecules) are provided in various other embodiments. In some embodiments, use of such compositions in analytical methods for detection of the one or more analyte molecules is also provided.

In still other embodiments, the compounds are useful in various analytical methods. For example, in certain embodiments the disclosure provides a method of staining a sample, the method comprising adding to said sample a compound of structure (I), for example wherein one of R¹ or R² is a linker comprising a covalent bond to an analyte molecule (e.g., biomolecule) or microparticle, and the other of R¹ or R² is H, OH, alkyl, alkoxy, alkylether or —OP(═R_(a))(R_(b))R_(c), in an amount sufficient to produce an optical response when said sample is illuminated at an appropriate wavelength.

In some embodiments of the foregoing methods, R¹ is a linker comprising a covalent linkage to an analyte molecule, such as a biomolecule. For example, in some embodiments the biomolecule is a nucleic acid, amino acid or a polymer thereof (e.g., polynucleotide or polypeptide). In still more embodiments, the biomolecule is an enzyme, receptor, receptor ligand, antibody, glycoprotein, aptamer or prion.

In yet other embodiments of the foregoing method, R¹ is a linker comprising a covalent linkage to a solid support such as a microparticle. For example, in some embodiments the microparticle is a polymeric bead or non-polymeric bead.

In even more embodiments, said optical response is a fluorescent response.

In other embodiments, said sample comprises cells, and some embodiments further comprise observing said cells by flow cytometry.

In still more embodiments, the method further comprises distinguishing the fluorescence response from that of a second fluorophore having detectably different optical properties.

In other embodiments, the disclosure provides a method for visually detecting an analyte molecule, such as a biomolecule, comprising:

(a) providing a compound of structure (I), for example, wherein one of R¹ or R² is a linker comprising a covalent bond to the analyte molecule, and the other of R¹ or R² is H, OH, alkyl, alkoxy, alkylether or —OP(═R_(a))(R_(b))R_(c); and

(b) detecting the compound by its visible properties.

In some embodiments the analyte molecule is a nucleic acid, amino acid or a polymer thereof (e.g., polynucleotide or polypeptide). In still more embodiments, the analyte molecule is an enzyme, receptor, receptor ligand, antibody, glycoprotein, aptamer or prion.

In other embodiments, a method for visually detecting an analyte molecule, such as a biomolecule is provided, the method comprising:

(a) admixing any of the foregoing compounds with one or more analyte molecules; and

(b) detecting the compound by its visible properties.

In other embodiments is provided a method for visually detecting an analyte molecule, the method comprising:

(a) admixing the compound of structure (I), wherein R¹ or R² is Q or a linker comprising a covalent bond to Q, with the analyte molecule;

(b) forming a conjugate of the compound and the analyte molecule; and

(c) detecting the conjugate by its visible properties.

Other exemplary methods include a method for detecting an analyte, the method comprising:

(a) providing a compound of structure (I), wherein R¹ or R² comprises a linker comprising a covalent bond to a targeting moiety having specificity for the analyte;

(b) admixing the compound and the analyte, thereby associating the targeting moiety and the analyte; and

(c) detecting the compound, for example by its visible or fluorescent properties.

In certain embodiments of the foregoing method, the analyte is a particle, such as a cell, and the method includes use of flow cytometry. For example, the compound may be provided with a targeting moiety, such as an antibody, for selectively associating with the desired cell, thus rendering the cell detectable by any number of techniques, such as visible or fluorescence detection. Appropriate antibodies can be selected by one of ordinary skill in the art depending on the desired end use. Exemplary antibodies for use in certain embodiments include UCHT1 and MOPC-21.

Embodiments of the present compounds thus find utility in any number of methods, including, but not limited: cell counting; cell sorting; biomarker detection; quantifying apoptosis; determining cell viability; identifying cell surface antigens; determining total DNA and/or RNA content; identifying specific nucleic acid sequences (e.g., as a nucleic acid probe); and diagnosing diseases, such as blood cancers.

In addition to the above methods, embodiments of the compounds of structure (I) find utility in various disciplines and methods, including but not limited to: imaging in endoscopy procedures for identification of cancerous and other tissues; single-cell and/or single molecule analytical methods, for example detection of polynucleotides with little or no amplification; cancer imaging, for example by including a targeting moiety, such as an antibody or sugar or other moiety that preferentially binds cancer cells, in a compound of structure (I) to; imaging in surgical procedures; binding of histones for identification of various diseases; drug delivery, for example by replacing the M¹ or M² moieties in a compound of structure (I) with an active drug moiety; and/or contrast agents in dental work and other procedures, for example by preferential binding of the compound of structure (I) to various flora and/or organisms.

It is understood that any embodiment of the compounds of structure (I), as set forth above, and any specific choice set forth herein for a R¹, R², R³, R⁴, R⁵, L¹, L², L³, L⁴, L⁵, L⁶, L⁷, M¹, M², q, w, m and/or n variable in the compounds of structure (I), as set forth above, may be independently combined with other embodiments and/or variables of the compounds of structure (I) to form embodiments of the disclosure not specifically set forth above. In addition, in the event that a list of choices is listed for any particular R¹, R², R³, R⁴, R₅, L¹, L², L³, L⁴, L⁵, L⁶, L⁷, M¹, M², q, w, m and/or n variable in a particular embodiment and/or claim, it is understood that each individual choice may be deleted from the particular embodiment and/or claim and that the remaining list of choices will be considered to be within the scope of the disclosure.

It is understood that in the present description, combinations of substituents and/or variables of the depicted formulae are permissible only if such contributions result in stable compounds.

It will also be appreciated by those skilled in the art that in the process described herein the functional groups of intermediate compounds may need to be protected by suitable protecting groups. Such functional groups include hydroxy, amino, mercapto and carboxylic acid. Suitable protecting groups for hydroxy include trialkylsilyl or diarylalkylsilyl (for example, t-butyldimethylsilyl, t-butyldiphenylsilyl or trimethylsilyl), tetrahydropyranyl, benzyl, and the like. Suitable protecting groups for amino, amidino and guanidino include t-butoxycarbonyl, benzyloxycarbonyl, and the like. Suitable protecting groups for mercapto include —C(O)—R″ (where R″ is alkyl, aryl or arylalkyl), p-methoxybenzyl, trityl and the like. Suitable protecting groups for carboxylic acid include alkyl, aryl or arylalkyl esters. Protecting groups may be added or removed in accordance with standard techniques, which are known to one skilled in the art and as described herein. The use of protecting groups is described in detail in Green, T. W. and P. G. M. Wutz, Protective Groups in Organic Synthesis (1999), 3^(rd) Ed., Wiley. As one of skill in the art would appreciate, the protecting group may also be a polymer resin such as a Wang resin, Rink resin or a 2-chlorotrityl-chloride resin.

Furthermore, all compounds of the disclosure which exist in free base or acid form can be converted to their salts by treatment with the appropriate inorganic or organic base or acid by methods known to one skilled in the art. Salts of the compounds of the disclosure can be converted to their free base or acid form by standard techniques.

The following Reaction Schemes illustrate exemplary methods of making compounds of this disclosure. It is understood that one skilled in the art may be able to make these compounds by similar methods or by combining other methods known to one skilled in the art. It is also understood that one skilled in the art would be able to make, in a similar manner as described below, other compounds of structure (I) not specifically illustrated below by using the appropriate starting components and modifying the parameters of the synthesis as needed. In general, starting components may be obtained from sources such as Sigma Aldrich, Lancaster Synthesis, Inc., Maybridge, Matrix Scientific, TCI, and Fluorochem USA, etc. or synthesized according to sources known to those skilled in the art (see, for example, Advanced Organic Chemistry: Reactions, Mechanisms, and Structure, 5^(th) edition (Wiley, December 2000)) or prepared as described in this disclosure.

Reaction Scheme I illustrates a method for preparation of intermediates useful for preparation of compounds of structure (I). Referring to reaction Scheme I, wherein L¹, L^(1b), L^(1b)′, L², L³, G¹ and M¹ are as defined above, and R¹ and R² are as defined above, or are protected variants thereof, a compound of structure a, which can be purchased or prepared by well-known techniques, is reacted with M-G¹′ to yield compounds of structure b. Here, G¹ and G¹′ represent functional groups having complementary reactivity (i.e., functional groups which react to form a covalent bond). G¹′ may be pendant to M¹ or a part of the structural backbone of M¹. G¹ and G¹′ may be any number of functional groups described herein, such as alkyne and azide, respectively, amine and activated ester, respectively or amine and isothiocyanate, respectively, and the like. M² can be attached to form a compound of structure (I) in an analogous manner by selecting appropriate reagents according to Reaction Scheme I above.

Additionally, compounds of the present disclosure can be prepared according to the methods described in PCT Pub. Nos. WO 2016/183185; WO 2017/173355; and WO 2017/177065, each of which are hereby incorporated by reference.

The compound of structure (I) may be prepared from structure b by reaction under well-known automated DNA synthesis conditions with a phosphoramidite compound having the following structure (c):

wherein L is an optional linker (e.g., L⁴). In some embodiments of (c), L has one of the following structures:

wherein:

L^(1a), L^(1b), L^(1b) L², L³, L⁵, L⁶, L⁷, L⁷′, G¹, G², M² and M¹ are as defined herein. DNA synthesis methods are well-known in the art. Briefly, two alcohol groups, for example R¹ and R² in intermediate b above, are functionalized with a dimethoxytrityl (DMT) group and a 2-cyanoethyl-N,N-diisopropylamino phosphoramidite group, respectively. The phosphoramidite group is coupled to an alcohol group, typically in the presence of an activator such as tetrazole, followed by oxidation of the phosphorous atom with iodine. The dimethoxytrityl group can be removed with acid (e.g., chloroacetic acid) to expose the free alcohol, which can be reacted with a phosphoramidite group. The 2-cyanoethyl group can be removed after oligomerization by treatment with aqueous ammonia.

Preparation of the phosphoramidites used in the oligomerization methods is also well-known in the art. For example, a primary alcohol (e.g., R¹) can be protected as a DMT group by reaction with DMT-Cl. A secondary alcohol (e.g., R²) is then functionalized as a phosphoramidite by reaction with an appropriate reagent such as 2-cyanoethyl N,N-dissopropylchlorophosphoramidite. Methods for preparation of phosphoramidites and their oligomerization are well-known in the art and described in more detail in the examples.

Compounds of structure (I) are prepared by oligomerization of intermediates b and c according to the well-known phophoramidite chemistry described above. The desired number of n repeating units is incorporated into the molecule by repeating the phosphoramidite coupling the desired number of times. It will be appreciated that compounds of structure (II) as, described below, can be prepared by analogous methods.

In various other embodiments, compounds useful for preparation of the compound of structure (I) are provided. The compounds can be prepared as described above in monomer, dimer and/or oligomeric form and then the M¹ and/or M² moiety covalently attached to the compound via any number of synthetic methodologies (e.g., the “click” reactions described above) to form a compound of structure (I). Accordingly, in various embodiments a compound is provided having the following structure (II):

or a stereoisomer, salt or tautomer thereof, wherein:

G¹ and G² are, at each occurrence, independently a moiety comprising a reactive group, or protected analogue thereof, capable of forming a covalent bond with a complementary reactive group;

L^(1a) is at each occurrence, independently a heteroarylene linker;

L^(1b′), L², L³, L⁵, L⁶, and L^(7′) are, at each occurrence, independently optional alkylene, alkenylene, alkynylene, heteroalkylene, heteroalkenylene or heteroalkynylene linkers;

L⁴ is, at each occurrence, independently an alkylene, alkenylene, alkynylene, heteroalkylene, heteroalkenylene or heteroalkynylene linker;

R¹ and R² are each independently H, OH, SH, alkyl, alkoxy, alkylether, heteroalkyl, —OP(═R_(a))(R_(b))R_(c), Q, or a protected form thereof, or L′;

R³ is, at each occurrence, independently H, alkyl or alkoxy;

R⁴ is, at each occurrence, independently OH, SH, O⁻, S⁻, OR_(d) or SR_(d);

R⁵ is, at each occurrence, independently oxo, thioxo or absent;

R_(a) is O or S;

R_(b) is OH, SH, O⁻, S⁻, OR_(d) or SR_(a);

R_(c) is OH, SH, O⁻, S⁻, OR_(d), OL′, SR_(d), alkyl, alkoxy, heteroalkyl, heteroalkoxy, alkylether, alkoxyalkylether, phosphate, thiophosphate, phosphoalkyl, thiophosphoalkyl, phosphoalkylether or thiophosphoalkylether;

R_(d) is a counter ion;

Q is, at each occurrence, independently a moiety comprising a reactive group, or protected form thereof, capable of forming a covalent bond with an analyte molecule, a targeting moiety, a solid support or a complementary reactive group Q′;

L′ is, at each occurrence, independently a linker comprising a covalent bond to Q, a linker comprising a covalent bond to a targeting moiety, a linker comprising a covalent bond to an analyte molecule, a linker comprising a covalent bond to a solid support, a linker comprising a covalent bond to a solid support residue, a linker comprising a covalent bond to a nucleoside or a linker comprising a covalent bond to a further compound of structure (I);

m is, at each occurrence, an integet of one or greater;

n is an integer of one or greater; and

q and w are, at each occurrence, independently 0 or 1, provided at least one occurrence of w is 1. In some embodiments, q is 0.

In some embodiments of compound (II), L^(1a) has one of the following structures:

In some more specific embodiments, the compound has the following structure (IIa):

In some related embodiments, the compound has one of the following structures (IIb) or (IIc):

wherein:

L^(1b′) is, at each occurrence, independently an optionally substituted alkylene or an optionally substituted heteroalkylene linker. In some embodiments, L^(1b′) is an optionally substituted heteroalkenylene linker.

In certain embodiments, the compound has one of the following structures (IId) or (IIe):

wherein:

z is an integer from 1 to 100.

In certain embodiments, L^(1b′) has one of the following structures:

In some specific embodiments, L^(1b′) is an alkylene, for example, ethylene, propylene, butylene or pentylene.

In certain related embodiments, -L^(1b′)-G has one of the following structures:

In other embodiments of structure (II), G¹ and G² are, at each occurrence, independently a moiety comprising a reactive group capable of forming a covalent bond with a complementary reactive group.

The G¹ and G² moieties in the compound of structure (II) can be selected from any moiety comprising a group having the appropriate reactivity group for forming a covalent bond with a complementary group on an M¹ and/or M² moiety. In exemplary embodiments, the G¹ and G² moieties can be selected from any of the Q moieties described herein, including those specific examples provided in Table 1. In some embodiments, G¹ and G² at each occurrence, independently comprises a moiety suitable for reactions including: the copper catalyzed reaction of an azide and alkyne to form a triazole (Huisgen 1, 3-dipolar cycloaddition), reaction of a diene and dienophile (Diels-Alder), strain-promoted alkyne-nitrone cycloaddition, reaction of a strained alkene with an azide, tetrazine or tetrazole, alkene and azide [3+2] cycloaddition, alkene and tetrazine inverse-demand Diels-Alder, alkene and tetrazole photoreaction and various displacement reactions, such as displacement of a leaving group by nucleophilic attack on an electrophilic atom.

In some embodiments, G¹ and G² are, at each occurrence, independently a moiety comprising an aldehyde, oxime, hydrazone, alkyne, amine, azide, acylazide, acylhalide, nitrile, nitrone, sulfhydryl, disulfide, sulfonyl halide, isothiocyanate, imidoester, activated ester, ketone, β,β-unsaturated carbonyl, alkene, maleimide, α-haloimide, epoxide, aziridine, tetrazine, tetrazole, phosphine, biotin or thiirane functional group. In certain embodiments, at least one occurrence of G¹ or G² has a structure selected from Table 1. In some more specific embodiments, G¹ or G², at each occurrence, independently have a structure selected from Table 1.

In other embodiments, G¹ and G² at each occurrence, independently comprises an alkyne or an azide group. In other embodiments, G¹ and G² at each occurrence, independently comprises an amino, isothiocyanate or activated ester group. In different embodiments, G¹ and G² at each occurrence, independently comprises a reactive group capable of forming a functional group comprising an alkene, ester, amide, thioester, disulfide, carbocyclic, heterocyclic or heteroaryl group, upon reaction with the complementary reactive group. For example, in some embodiment the heteroaryl is triazolyl.

In other of any of the foregoing embodiments of compound (II), G¹ and G² are, at each occurrence, independently

In some embodiments, at least one occurrence of G¹ or G² has one of the following structures:

In some related embodiments, G¹ or G², at each occurrence, independently have one of the following structures:

In some embodiments of compound (II), at least one occurrence of G¹ is

In more specific embodiments, G¹ is, at each occurrence, independently

In some embodiments of compound (II), at least one occurrence of G¹ or G² is —NH₂. In some embodiments, G¹ and G² are, at a plurality of occurrences, independently —NH₂. In certain embodiments, G¹ and G² are, at each occurrence, independently —NH₂.

In some embodiments of compound (II), at least one occurrence of G¹ and G² is a protected form of an amine. In some embodiments, G¹ and G² are, at a plurality of occurrences, independently a protected form of an amine. In certain embodiments, G¹ and G² are, at each occurrence, independently a protected form of an amine.

In some of the foregoing embodiments, the protected form of the amine is a trifluoroacetate protected amine. In some embodiments, the protected form of the amine is a BOC protected amine. In some embodiments, the protected form of the amine is an Fmoc protected amine. For example, in certain embodiments, at least one occurrence of G¹ or G² has one of the following structures:

In more specific embodiments, G¹ or G², at each occurrence, independently has one of the following structures:

In some embodiments, R¹, R², R³, R⁴, R⁵, L², L³, L⁴, L⁵, or L⁶ are as defined in any one of the foregoing embodiments. For example, in some embodiments of compound (II), R⁴ is at each occurrence oxo. In some embodiments of compound (II), R⁵ is at each occurrence, independently OH, O⁻ or OR_(d). In certain embodiments of compound (II), L² is absent at each occurrence. In some specific embodiments of compound (II), L³ is an alkylene linker (e.g., methylene) at each occurrence.

In some embodiments, the compound of structure (II) is a compound of Table 3.

TABLE 3 Exemplary Compounds of Structure II No. Structure II-1

II-2

II-3

†Compound II-2 is drawn as a structure representing the average ethylene glycol units

As described in detail above, compounds of structures (I) and compounds of structure (II) can be prepared by oligomerization using well known phosphoramidite chemistry. Applicants have discovered intermediate compounds useful for synthesis of compounds of structures (I) and compounds of structure (II). Accordingly, one embodiment provides a compound having the following structure (III):

wherein:

n₁ is an integer from 1 to 6;

n₂ is an integer from 1 to 3;

X is O or a direct bond;

R¹″ and R²″ are, at each occurrence, independently H, a protecting group, or an activated phosphorus moiety;

R³″ is H, or alkyl;

R⁴″ is alkoxy, haloalkyl, alkyl, an optionally substituted aryl or an optionally substituted aralkyl.

In some embodiments of compound (III), n₁ is 2. In some embodiments, n is 4. In some related embodiments, n₂ is 1. In certain embodiments, n₁ is 2 and n₂ is 1. In other embodiments, n₁ is 4 and n₂ is 1. In some of the foregoing embodiments, X is a direct bond.

In some embodiments of compound (III), n₁ is 2. In certain related embodiments, n₂ is 2. In some of the foregoing embodiments, X is O.

In some embodiments of compound (III), X is a direct bond. In some embodiments, X is O.

In some embodiments of compound (III), R¹″ is H. In certain embodiments, is a protecting group, for example, a trityl protecting group. In some embodiments, R¹″ is trityl. In some embodiments, R¹″ is 4-methoxytrityl. In more specific embodiments, R¹″ is 4,4′-dimethoxytrityl.

In some embodiments, R²″ is H. In some embodiments, R²″ is an activated phosphorus moiety. For example, in some embodiments R²″ comprises the following structure:

wherein:

R⁵″ is H or cyano alkyl; and

R⁶″ is, at each occurrence, independently C₁-C₆ alkyl.

In some embodiments of compound (III), R⁵″ is H. In other embodiments, R⁵″ is 2-cyanoethyl.

In some embodiments, at least one occurrence of R⁶″ is isopropyl. In some embodiments, each occurrence of R⁶″ is isopropyl.

In certain specific embodiments, R²″ has the following structure:

In some embodiments of compound (III), R³″ is H.

In some embodiments of compound (III), R⁴″ is an aryl comprising 1, 2, or 3 aromatic rings, e.g., R⁴″ comprises 1 or 2 aromatic rings. In some embodiments, R⁴″ does not comprise silicon. In some embodiments, R⁴″ is C₁-C₄ haloalkyl. In more specific embodiments, R⁴″ is —CF₃. In some embodiments, R⁴″ is C₁-C₄ alkoxy. In more specific embodiments, R⁴″ is tert-butoxy.

In some specific embodiments, compound (III) is selected from Table 4.

TABLE 4 Exemplary Compounds of Structure III No. Structure III-1

III-2

III-4

The following examples are provided for purposes of illustration, not limitation.

EXAMPLES General Methods

Mass spectral analysis was performed on a Waters/Micromass Quattro micro MS/MS system (in MS only mode) using MassLynx 4.1 acquisition software. Mobile phase used for LC/MS on dyes was 100 mM 1,1,1,3,3,3-hexafluoro-2-propanol (VIP), 8.6 mM triethylamine (TEA), pH 8. Phosphoramidites and precursor molecules were also analyzed using a Waters Acquity UHPLC system with a 2.1 mm×50 mm Acquity BEH-C₁₈ column held at 45° C., employing an acetonitrile/water mobile phase gradient. Molecular weights for monomer intermediates were obtained using tropylium cation infusion enhanced ionization on a Waters/Micromass Quattro micro MS/MS system (in MS only mode). Excitation and emission profiles experiments were recorded on a Cary Eclipse spectra photometer.

All reactions were carried out in oven dried glassware under a nitrogen atmosphere unless otherwise stated. Commercially available DNA synthesis reagents were purchased from Glen Research (Sterling, Va.). Anhydrous pyridine, toluene, dichloromethane, diisopropylethyl amine, triethylamine, acetic acid, pyridine, and THE were purchased from Aldrich. All other chemicals were purchase from Aldrich or TCI and were used as is with no additional purification.

Example 1 Synthesis of Compound I-1 Stock Solution Preparation

Borate buffer prepared at 250 mM, pH 10 Fluorscein-NHS solution prepared at 350 mM (300 mg in 1.35 mL DMSO:acetonitrile at 25:75)

Solid Phase Synthesis

Compound I-1 was prepared on the DNA synthesizer via solid support using standard DNA synthesis techniques (i.e., DMT protected 2-cyanoethyl phosphoramidite). The polymer was removed from the solid support with ammonium hydroxide and lyophilized to a paste. 250 mg aliquots were reconstituted in water. A small aliquot was removed and serial dilutions were prepared in 100 mM NaCO₃ at pH 9 to determine concentration (A 263 ε=10,000). Final stock concentration was found to be 14.5 mM.

Dye Coupling Reaction

In 50 mL centrifuge tube equipped with magnetic stir bar was placed water (1.110 μL), borate buffer (1.800 μL), Compound I-1 polymer solution (466 μL), acetonitrile (137.5 μL), triethylamine (313 μL) and fluorescein-NHS solution (675 μL). The tube was wrapped in aluminum foil and the mixture stirred overnight at room temperature.

Size Exclusion Filtration

To an Amicon Ultra-15 Centrifugal filter (Millipore UFC900324, MW cutoff=3000) was added 1 mL of water. The crude reaction from the dye coupling reaction (4.5 mL) was added to the filtration setup. The reaction vessel was rinsed 2× with 4 mL of 100 mM NaOH and the rinseates were transferred to the filtration setup. The filtration setup was centrifuged at max speed (3220 g, swing bucket, 30 minutes). The filtrate was removed and the retentate treated with an additional 10 mL of 100 mM NaOH. The filtration setup was centrifuged as before. Again, the filtrate was removed and a third 10 mL 100 mM NaOH aliquot was added to the retentate. The setup was centrifuged as before and the filtrate removed. A fourth 10 mL 100 mM NaOH aliquot was added to the retentate and centrifuged as before. The filtrate was removed and 10 mL of water were added to the filtration setup. The mixture was centrifuged as before. The retentate was removed, the filtration vessel washed with water and the rinesates added to the final volume (3.5 mL). The desired product was confirmed by LC-MS and absorbance was used to determine concentration. A sample of Compound I-1 was also analyzed by PAGE (FIG. 1). FIG. 1 shows Compound A (MW=14104), Compound B (MW=15686) and Compound C (MW=16231) compared to Compound I-2.

Example 2 Synthesis of Compound I-2 Stock Solutions

Borate buffer was prepared as described in Example 1 4.6 M Magnesium chloride 300 mM Fluorscein-NHS solution was prepared in DMSO

Solid Phase Synthesis

Compound I-2 was prepared on the DNA synthesizer via solid support using standard DNA synthesis techniques (i.e., DMT protected 2-cyanoethyl phosphoramidite). The polymer was removed from the solid support with concentrated ammonium hydroxide and lyophilized to a paste. 20 mg was reconstituted from water and a small aliquot was removed, serial dilutions were prepared in 100 mM NaCO₃ at pH 9 to determine concentration (A 263, ε=10,000). Final stock concentration was found to be 11.6 mM.

Dye Coupling Reaction

In a 200 μL micro-centrifuge tube was placed water (10 μL), borate buffer (20 μL), Compound I-2 polymer solution (2.2 μL), magnesium chloride solution (5.4 μL), DMSO (6.9 μL), fluorescein-NHS solution (5.6 μL). The tube was vortexed and allowed to stand overnight. The mixture was diluted with 50 mL of water and desalted with polyacrylamide desalting columns (Pierce, catalogue #89849). The desired product was confirmed by LC-MS and analyzed by PAGE (FIG. 2). FIG. 2 shows Compound A (MW=14104), Compound B (MW=15686) and Compound C (MW=16231) compared to Compound I-2.

Example 3 Flow Cytometry Method and Applications

A general flow cytometry workflow includes the following steps:

1. Culture and visually observe cells for signs of metabolic stress and/or use fresh, induced, or simulated cells.

2. Dilute dye compounds to working volumes.

3. Harvest and prepare cells without killing or inducing apoptosis.

4. Centrifuge and wash cells with appropriate buffer.

5. Perform cell counts using hemocytometer and trypan blue exclusion.

6. Centrifuge and wash cells

7. Adjust cell density to test size

8. Apply dye (pre-dilution) or other co-stains of interest.

9. Incubate the cell/stain/dye mixture.

10. Centrifuge and wash cells with appropriate buffer.

11. Re-suspend cells in acquisition buffer.

12. Acquire cell data by flow cytometry.

The general workflow described above can be modified according to specific applications. Some modifications for specific applications are described below.

Live/Dead Discrimination

Cells are tested for viability by positively staining necrotic cells to compare damaged cells to intact cells. Assays are used to target non-intact (fixed and non-fixed) cells with positively charged moieties, cell debris, apoptotic bodies, depolarized cell membrane, and permeabilized membranes. Cells are then stained with dye (e.g., Compound I-1) using routine cell preparations (fresh or fixed) and analyzed using flow cytometry.

Cell Health

A comparison is made between dead cells (i.e., necrotic cells), early apoptotic, late apoptotic, and live cells. Dead cells are positively stained, Apoprotic bodies are intermediately stained, and live cells are left negative. This strategy results in very bright necrotic cells and works also to assess cell permeability. Assays are used to target non-intact (fixed and non-fixed) cells with positively charged moieties, cell debris, apoptotic bodies, depolarized cell membrane, and permeabilized membranes. Dye staining is performed on in vitro cultures, primary cells, and samples treated with xenobiotics and analyzed using flow cytometry.

Cell Cycle

Cell ploidy and mitosis in the cell cycle is tracked by staining correlated to positively staining DNA intercalators in all cells and cellular bodies containing nucleic acid and cell cycle associated proteins. Assays are used to target non-intact (non-fixed only) cells with positively charged moieties, cell debris, apoptotic bodies, depolarized cell membrane, and permeabilized membranes. Assays are used to target intact (fixed and permeabilized) cells by staining positively charged moieties after preservation of cells are fixed and permeabilized for intracellular staining. Dye staining (in combination with other dyes) is performed on in vitro cultures, primary cells, and samples treated with xenobiotics and analyzed using flow cytometry.

Proliferation

Cell proliferation is monitored by staining correlated to positively staining DNA intercalators in all cells and cellular bodies containing nucleic acid and cell cycle associated proteins. Assays are used to target non-intact (non-fixed only) cells with positively charged moieties, cell debris, apoptotic bodies, depolarized cell membrane, and permeabilized membranes. Assays are used to target intact (fixed and permeabilized) cells by staining positively charged moieties after preservation of cells are fixed and permeabilized for intracellular staining. Dye staining (in combination with monitoring markers for cell proliferation, e.g. Ki67, BRDU) is performed on in vitro cultures, primary cells, and samples treated with xenobiotics and analyzed using flow cytometry.

Example 4 Activation and Antibody Conjugation of Compound I-1

The maleimide functionalized Compound I-1 is prepared according to the method described in Example 1. In parallel, an UCHT-1 antibody is treated with bis-maleimidoethane (“BMOE”) to reduce disulfide bonds. The reduced antibody is reacted with Compound I-1 in a 5:1 molar ratio of polymer to antibody. The reaction results in a final product having a polymer to antibody ratio of 1:1 as detected by size exclusion chromatography.

All of the U.S. patents, U.S. patent application publications, U.S. patent applications, foreign patents, foreign patent applications and non-patent publications referred to in this specification, including U.S. Provisional Patent Application No. 62/690,656, filed Jun. 27, 2018, are incorporated herein by reference, in their entirety to the extent not inconsistent with the present description.

From the foregoing it will be appreciated that, although specific embodiments of the disclosure have been described herein for purposes of illustration, various modifications may be made without deviating from the spirit and scope of the disclosure. Accordingly, the disclosure is not limited except as by the appended claims. 

1. A compound having the following structure (I):

or a stereoisomer, salt or tautomer thereof, wherein: M¹ is, at each occurrence, independently a moiety comprising a chromophore; L^(1a) is, at each occurrence, independently a heteroarylene linker; L^(1b), L² and L³ are, at each occurrence, independently optional alkylene, alkenylene, alkynylene, heteroalkylene, heteroalkenylene or heteroalkynylene linkers; L⁴ is, at each occurrence, independently an alkylene, alkenylene, alkynylene, heteroalkylene, heteroalkenylene or heteroalkynylene linker; R¹ and R² are each independently H, OH, SH, alkyl, alkoxy, alkylether, heteroalkyl, —OP(═R_(a))(R_(b))R_(c), Q, or a protected form thereof, or L′; R⁴ is, at each occurrence, independently OH, SH, O⁻, S⁻, OR_(d) or SR_(d); R⁵ is, at each occurrence, independently oxo, thioxo or absent; R_(a) is O or S; R_(b) is OH, SH, O⁻, S⁻, OR_(d) or SR_(d); R_(c) is OH, SH, O⁻, S⁻, OR_(d), OL′, SR_(d), alkyl, alkoxy, heteroalkyl, heteroalkoxy, alkylether, alkoxyalkylether, phosphate, thiophosphate, phosphoalkyl, thiophosphoalkyl, phosphoalkylether or thiophosphoalkylether; R_(d) is a counter ion; Q is, at each occurrence, independently a moiety comprising a reactive group, or protected form thereof, capable of forming a covalent bond with an analyte molecule, a targeting moiety, a solid support or a complementary reactive group Q′; L′ is, at each occurrence, independently a linker comprising a covalent bond to Q, a linker comprising a covalent bond to a targeting moiety, a linker comprising a covalent bond to an analyte molecule, a linker comprising a covalent bond to a solid support, a linker comprising a covalent bond to a solid support residue, a linker comprising a covalent bond to a nucleoside or a linker comprising a covalent bond to a further compound of structure (I); m is, at each occurrence, an integer of one or greater; and n is an integer of one or greater. 2-7. (canceled)
 8. The compound of claim 1, wherein L^(1a) has one of the following structures:


9. (canceled)
 10. The compound of claim 1, wherein at least one occurrence of L³ is an alkylene linker, and at least one occurrence of L⁴ comprises alkylene oxide. 11-19. (canceled)
 20. The compound of claim 1, wherein the compound has one of the following structures (Ib) or (Ic):

wherein: L^(1b) is, at each occurrence, independently an optionally substituted alkylene or an optionally substituted heteroalkylene linker.
 21. The compound of claim 1, wherein L⁴ is polyethylene oxide, and the compound has one of the following structures (Id) or (Ie):

wherein: z is an integer from 1 to
 100. 22. The compound of claim 1, wherein L^(1b), at each occurrence, independently comprises an amide functional group or a triazolyl functional group.
 23. The compound of claim 1, wherein R⁴ is, at each occurrence, oxo, and R⁵ is, at each occurrence, independently OH, O⁻ or OR_(d). 24-26. (canceled)
 27. The compound of claim 1, wherein R¹ and R² are each independently —OP(═R_(a))(R_(b))R_(c).
 28. The compound of claim 27, wherein R_(c) is OL′, wherein L′ is heteroalkylene linker to: Q, a targeting moiety, an analyte molecule, a solid support, a solid support residues, a nucleoside or a further compound of structure (I).
 29. (canceled)
 30. The compound of claim 28, wherein L′ comprises an alkylene oxide or phosphodiester moiety, or combinations thereof.
 31. The compound of claim 30, wherein L′ has the following structure:

wherein: m″ and n″ are independently an integer from 1 to 10; R^(e) is H, an electron pair or a counter ion; and L″ is R^(e) or a direct bond or linkage to: Q, a targeting moiety, an analyte molecule, a solid support, a solid support residue, a nucleoside or a further compound of structure (I).
 32. The compound of claim 29, wherein the targeting moiety is an antibody or cell surface receptor antagonist.
 33. The compound of claim 1, wherein R¹ or R² has one of the following structures:

34-35. (canceled)
 36. The compound of claim 1, wherein Q comprises a sulfhydryl, disulfide, activated ester, isothiocyanate, azide, alkyne, alkene, diene, dienophile, acid halide, sulfonyl halide, phosphine, α-haloamide, biotin, amino or maleimide functional group. 37-38. (canceled)
 39. The compound of claim 1, wherein Q is a moiety having one of the following structures:

wherein: X is halo; and EWG is an electron withdrawing group. 40-45. (canceled)
 46. The compound of claim 1, wherein n is an integer from 1 to 10 and m is an integer from 3 to
 12. 47-55. (canceled)
 56. The compound of claim 1, wherein M¹ and M² are, at each occurrence, independently pyrene, perylene, perylene monoimide or 6-FAM or derivative thereof.
 57. The compound of claim 1, wherein M¹, at each occurrence, independently has one of the following structures:


58. A compound having one of the following structures:

wherein F, F′ and F″ refer to a fluorescein moiety having the following structures, respectively:

R² is

wherein R is H or direct bond.
 59. A method of staining a sample, comprising adding to said sample the compound of claim 1 in an amount sufficient to produce an optical response when said sample is illuminated at an appropriate wavelength. 60-63. (canceled)
 64. A method for visually detecting an analyte molecule, the method comprising: (a) providing the compound of claim 1, wherein R¹ or R² is a linker comprising a covalent bond to the analyte molecule; and (b) detecting the compound by its visible properties.
 65. A method for visually detecting an analyte molecule, the method comprising: (a) admixing the compound of claim 1, wherein R¹ or R² is Q or a linker comprising a covalent bond to Q, with the analyte molecule; (b) forming a conjugate of the compound and the analyte molecule; and (c) detecting the conjugate by its visible properties.
 66. A method for visually detecting an analyte, the method comprising: (a) providing the compound of claim 1, wherein R¹ or R² comprises a linker comprising a covalent bond to a targeting moiety having specificity for the analyte; (b) admixing the compound and the analyte, thereby associating the targeting moiety and the analyte; and (c) detecting the compound by its visible properties.
 67. A composition comprising the compound of claim 1 and one or more analyte molecules.
 68. (canceled)
 69. A compound having the following structure (IIa):

or a stereoisomer, salt or tautomer thereof, wherein: G¹ is, at each occurrence, independently a moiety comprising a reactive group, or protected analogue thereof, capable of forming a covalent bond with a complementary reactive group; L^(1a) is, at each occurrence, independently a heteroarylene linker; L^(1b′), L² and L³ are, at each occurrence, independently optional alkylene, alkenylene, alkynylene, heteroalkylene, heteroalkenylene or heteroalkynylene linkers; L⁴ is, at each occurrence, independently an alkylene, alkenylene, alkynylene, heteroalkylene, heteroalkenylene or heteroalkynylene linker; R¹ and R² are each independently H, OH, SH, alkyl, alkoxy, alkylether, heteroalkyl, —OP(═R_(a))(R_(b))R_(c), Q, or a protected form thereof, or L′; R³ is, at each occurrence, independently H, alkyl or alkoxy; R⁴ is, at each occurrence, independently OH, SH, O⁻, S⁻, OR_(d) or SR_(d); R⁵ is, at each occurrence, independently oxo, thioxo or absent; R_(a) is O or S; R_(b) is OH, SH, O⁻, S⁻, OR_(d) or SR_(d); R_(c) is OH, SH, O⁻, S⁻, OR_(d), OL′, SR_(d), alkyl, alkoxy, heteroalkyl, heteroalkoxy, alkylether, alkoxyalkylether, phosphate, thiophosphate, phosphoalkyl, thiophosphoalkyl, phosphoalkylether or thiophosphoalkylether; R_(d) is a counter ion; Q is, at each occurrence, independently a moiety comprising a reactive group, or protected form thereof, capable of forming a covalent bond with an analyte molecule, a targeting moiety, a solid support or a complementary reactive group Q′; L′ is, at each occurrence, independently a linker comprising a covalent bond to Q, a linker comprising a covalent bond to a targeting moiety, a linker comprising a covalent bond to an analyte molecule, a linker comprising a covalent bond to a solid support, a linker comprising a covalent bond to a solid support residue, a linker comprising a covalent bond to a nucleoside or a linker comprising a covalent bond to a further compound of structure (I); m is, at each occurrence, an integer of one or greater; and n is an integer of one or greater. 70-97. (canceled)
 98. A compound of structure (III) having one of the following structures: 